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Preface 



Organization of the Book 

This book has been conceived to progressively introduce concepts of increasing 
difficulty and allow learning of theories and control methods in a way which is 
not too brutal. It contains different levels of reading (Figure 1). In particular, 
the majority of the first part can be undertaken by students beginning in con- 
trol or by technicians and engineers coming from the industrial world, having 
up to that point only practical contact with control and a desire to improve 
their knowledge. The subsequent parts need a minimum previous knowledge 
in control. They also allow the use of often higher-performance techniques. 
Without pretending to be exhaustive, this book proposes a wide range of iden- 
tification and control methods applicable to processes and is accompanied by 
similar typical examples that provide comparison elements. 

This book does not pretend to compete with theoretical control books spe- 
cialized in one subject or another, e.g. identification, signal processing, mul- 
tivariable control, robust control or nonlinear control. However, the readers 
whoever they are, undergraduate or graduate students, engineers, researchers, 
professors, will find numerous references and statements that enable under- 
standing and application in their own domain of a large number of these con- 
cepts, taking their inspiration from the cases treated in the present book. 
Several controls are examined under different angles: 

• single-input single-output internal model control in continuous and discrete 
time, multivariable internal model control in discrete time, 

• pole-placement control in continuous and discrete time, 

• single-input single-output linear quadratic control by continuous transfer 
function and multivariable state-space linear quadratic control in continuous 
and discrete time, 

• single-input single-output generalized predictive control, multivariable model 
predictive control, possibly with observer, linear or nonlinear. 

The consideration of the same problem by different approaches thus enables 
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a thorough examination. 

Examples are most often taken from the chemical engineering domain, and 
include in general chemical, biological and polymerization reactors, a catalytic 
cracking reactor (FCC) and distillation columns. These examples are detailed, 
even at the numerical level, so that the used reasoning can be verihed and taken 
again by the reader. Simulations have been realized by means of Matlab© and 
Fortran?? codes. 
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Figure 1: General organization of the book 



Part I concerns single-input single-output continuous-time control. The cho- 
sen presentation of single-input single-output feedback linear process control is 
classical and voluntarily simple for most of the part. It presents the pedagog- 
ical advantage of decomposing the approach to a control problem, introduces 
a given number of important notions and, according to our opinion, facilitates 
the understanding of discrete-time control, called digital control, and of non- 
linear control. Also, it is close, in its conception, to a large part of industrial 
practice, at least in the domain of chemical engineering. Continuous PID con- 
trol is abundantly treated, but without exclusivity. The main types of dynamic 
models met in chemical engineering are commented on and system models are 
presented as well in state space as transfer functions (Chap. 1). Control is 
hrst simply related to the PID controller (Chap. 2). Stability is presented 
both for linear and nonlinear systems. Thus, the stability of a polymerization 
reactor is detailed by displaying the multiple stationary states in relation to 
the physical behaviour of the reactor (Chap. 3). The design of controllers 
hrst deals with PID, then is broadened to internal model control, which is very 
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important industrially, pole-placement control and linear quadratic control by 
means of continuous transfer functions (Chap. 4). Frequency analysis begins 
classically by the analysis in Bode and Nyquist representations, but is then 
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extended to robustness and sensitivity functions (Chap. 5). The improvements 
of controllers including time delay compensation, cascade control and feedfor- 
ward control are reviewed with application examples in industrial processes 
(Chap. 6). The hrst part hnishes with the concepts of state representation for 
linear systems, controllability and observability (Chap. 7). Some more difficult 
parts of different chapters such as robustness, pole-placement control or linear 
quadratic control can be tackled in a subsequent reading. 

Part II consists of only one chapter which deals with multivariable control by 
either continuous or discrete transfer functions matrix. This choice was made 
because of the relatively common practice of system representation chosen in 
process control. The chapter essentially presents general concepts for the design 
of a multivariable control system. Other types of multivariable control are 
treated in specific chapters in other parts of the book: linear quadratic control 
and Gaussian linear quadratic control in Chap. 14, model predictive control in 
Chap. 16 and nonlinear multivariable control in Chap. 17. In fact. Part III 
can be studied before Part II. 

Part III begins by considering signal processing whose general concepts are 
necessary in identification and control. Then, the general aspects of digital 
control and sampling are treated and discrete transfer functions are introduced 
(Chap. 9). The remainder of Part III is devoted to discrete-time identification. 
First, different model types are presented and the principles of identification are 
explained (Chap. 10), then different types of models are presented (Chap. 11). 
Lastly, the main algorithms of parametric identification are detailed with many 
indications on usage precautions (Chap. 12). The parametric identification of 
a chemical reactor is presented. Identification is treated as single-input single- 
output except with respect to the Kalman filter, which can be applied to multi- 
input multi-output systems. 

In Part IV, several classical types of digital control are studied. Chapter 
13 describes pole-placement, digital PID and discrete internal model control as 
single-input single-output control with application to the same chemical reac- 
tor. In Chap. 14, optimal control is considered in the general framework of 
dynamic optimization applicable to nonlinear continuous or discrete systems; it 
includes general methods such as variational methods, Euler, Hamilton-Jacobi, 
Pontryagin and dynamic programming. Linear quadratic and linear quadra- 
tic Gaussian control is presented in direct relation to optimal control both for 
continuous and discrete state-space system descriptions. An application of mul- 
tivariable linear quadratic Gaussian control to an extractive distillation column 
with two inputs and two outputs is presented. Two types of predictive control 
are studied. Chapter 15 concerns generalized predictive control for single-input 
single-output systems represented by their discrete transfer function with ap- 
plication to the previously mentioned chemical reactor. Chapter 16 is devoted 
to model predictive control applicable to large multivariable systems known by 
transfer functions or state-space models. Furthermore, model predictive control 
is popular in industry as it allows constraints to be taken into account. Two 
multivariable applications for a catalytic cracking reactor (FCC) are shown. 

Part V concerns nonlinear control presented through differential geometry 
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(Chap. 17) and state observers (Chap. 18). These are recent developments in 
control, and are potentially very powerful. To facilitate its approach, several 
concepts are analyzed from a linear point of view, then nonlinear control for a 
single-input single-output system is studied with input-state and input-output 
linearization. Nonlinear multivariable control is just outlined. State estimation 
is necessary in nonlinear control. Chapter 18 on observers does not concern the 
linear Kalman hlter described in Part III, but considers nonlinear observers 
including the extended Kalman hlter and the high-gain observer, as well as 
statistical estimators. 

Part VI considers two important classes of chemical processes: reactors 
and distillation columns. In the previous parts, linear identihcation and linear 
control are applied to the chemical reactor described in detail in Chap. 19. In 
this hrst chapter of Part VI, the use of geometric nonlinear control, based on the 
knowledge model of the process or coupled with a state observer, is explained 
for a chemical reactor and a biological reactor. It must be noticed that this 
is the same simulated chemical reactor that was used for identihcation of a 
linear model and several discrete linear control methods. Chapter 20 sweeps 
the control methods used in distillation since the 1970s until our epoch that is 
marked by the start of industrial use of nonlinear control. Chap. 21 describes 
different processes and benchmarks that can be used as more or less complicated 
examples to test various control strategies. 
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Dynamic Modelling of 
Chemical Processes 

1.1 References 

Many textbooks and research books are available. When a precise reference is 
given for a textbook which describes a given topic particularly well, the present 
author has tried not to forget his colleagues and mentions them at the end of the 
concerned chapter. When general textbooks can be recommended for different 
points, they are mentioned at the end of this first chapter to avoid too much 
repetition. It is impossible to cite all textbooks and the fact that some are not 
cited does not mean that they are of lower value. Of course, research papers 
are referenced in the concerned chapter. 



1.2 Applications of Process Control 

A chemical plant represents a complex arrangement of different units (reactors; 
separation units such as distillation, absorption, extraction, chromatography 
and filtration; heat exchangers; pumps; compressors; tanks; . . .). These units 
must be either maintained close to their steady states for continuous operation 
or follow optimal trajectories for batch operation. 

The engineers in charge of a plant must ensure quantitative and qualitative 
product specifications and economic performance while meeting health, safety 
and environmental regulations. 

The task of a control system is to ensure the stability of the process, to 
minimize the influence of disturbances and perturbations and to optimize the 
overall performance. These objectives are achieved by maintaining some varia- 
bles (temperature, pressure, concentration, position, speed, quality, . . .) close 
to their desired values or using set points which can be fixed or time-dependent. 

When a chemical engineer designs a process control system, he or she must 
first study the process and determine its characteristics. The process variables 
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are classified as inputs, outputs, and states. 

Subsequently, a process model with varying degrees of complexity (accord- 
ing to the ultimate use of the model) is derived. For an existing process, a black 
box model (where coefficients have no physical meaning) may be developed by 
system identification techniques, often with little effort. On the other hand, 
a physical model based on first principles involves a great deal of engineering 
effort. 

The developed model is in general verified in an off-line manner and inde- 
pendently of the control scheme. Then, the model is used together with the 
chosen control scheme to check the process response to set point and distur- 
bances variations. 

The control scheme can be a simple proportional controller as well as a 
much more sophisticated algorithm, e.g. if nonlinear control based on a physical 
model of the process is utilized. 

The outputs of the simulation model used for control can be compared to 
those of the real process or to those from an accurate model. System identifi- 
cation (Fig. 1.1) allows the engineer to estimate the parameters of the model 
or to evaluate the performances of the control law. 




Figure 1.1: Design of a process model for control 



1.3 Process Description from the Control 
Engineer’s Viewpoint 

The control engineer considers the process to be controlled as a dynamic system 
having inputs, outputs and internal variables called the state variables. His/her 
classification is different from the process engineer’s point of view. Indeed, 
for a process engineer, inputs are essentially physical streams (such as a feed 
pipe) delivering material and energy to the process, possibly information such 
as electrical signals, and outputs are similarly physical streams, withdrawing 
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materials (the effluent stream to the downstream processing unit) and energy 
from the process. 



External disturbances 
Measured Unmeasured 



Manipulated 

variables 



1 Measured outputs 

Process 

(states) Unmeasured outputs 



Figure 1.2: Input-output block diagram representation of a process 

From the control engineer’s point of view, variables associated with a process 
(flow rate, concentration, temperature, pressure, quality, . . .) are generally con- 
sidered as signals transferring information. These variables are divided into two 
groups (Fig. 1.2): 

— Inputs which represent the influence of environment on the process: these 
variables affect the process and thus modify its behaviour. 

— Outputs which represent the process influence on environment: these varia- 
bles represent the link with the outside. They should be maintained close 
to their set points. 

Input and output variables are linked by state variables (see state representa- 
tion) which are internal to the process and help to describe the evolution of 
the process with time. Any modihcation of the inputs affects dynamically the 
process states, which in turn influence algebraically the process outputs. 

Input variables are divided into: 

— Control variables or manipulated variables which can be adjusted freely by 
the operator or by regulatory means. For example, the position of a valve 
stem determines the flow rate through the valve. In this case, the input or 
manipulated variable will be the valve stem position or the flow rate in the 
pipe which is directly related to the valve position. 

— Disturbances include all other inputs which are not set by the operator or 
a controller. For example, in the case of temperature control in a building, 
outside climate temperature and humidity changes are considered as distur- 
bances which affect the controlled variable that is the inner temperature. 

Output variables are divided into: 

— Measured variables, whose values are known using direct on-line measure- 
ment by a sensor. 

— Unmeasured variables, which may be estimated or inferred using an indirect 
or secondary measurement. This estimation of unmeasured variables by 
means of a model and other measurements constitutes a soft sensor (Chap. 
18). 
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Figure 1.2 represents an input-output block diagram of a process. 

A system is called single variable or single-input single-output (SISO) if it 
has only one input and one output. It is called multivariable or multi-input 
multi-output (MIMO) if it has several inputs and several outputs. In general, 
the number of inputs is larger than the number of outputs. 




0 5 10 15 20 25 30 35 40 45 50 




Figure 1.3: Different types of inputs starting at time t = 7 after steady state 

In order to study the behaviour of a process, the process inputs are generally 
varied by some simple and standard functions and the response of the process 
is monitored in each case. Apart from the disturbances, which can take any 
form, the standard input functions / (Fig. 1.3) are: 

— A step function: a unit step function is defined as / = 1 if t > 0 , / = 0 if 
t < 0. Its response is called a step response. 

— An impulse function: a unit impulse function is defined as: f = S (theore- 
tical Dirac). Its response is called an impulse response. 

— A sinusoidal function: f = a cos{iot + (/)). Its response is referred to as a 
frequency response. 

— A ramp: / = kt. This determines the behaviour of the process output to 
an input with constant rate of change (constant velocity). 

— A parabolic function: / = kt^. This is used whenever the response to a 
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constant acceleration is desired. 

The inputs to a controlled physical system can be further classihed as dis- 
turbances (loads) (for regulatory control) or set point variations (for set point 
tracking or servo-control). 



1.4 Model Classification 

Models can be classihed with respect to different user-specified criteria. In the 
steady-state models, the time derivatives of the state variables are set to zero 
{d/dt = 0). In the dynamic models which describe the transient behaviour of 
the process, the process variables are time-dependent (/(t, x, ix)). In process 
control applications, the models must be dynamic in order to represent the 
process variations with respect to time. 

Dynamic models can be of two kinds: deterministic, in which it is assumed 
that all the variables are perfectly known at a given instant of time, or prob- 
abilistic (stochastic) models, which make use of probability distributions to 
account for the variations and uncertainties associated with the process and its 
variables. 

Dynamic models can be continuous when the function /(t, x, u) describing 
the process is continuous or discrete with respect to time (variables are only 
known at regular time intervals). 

A model can be developed using merely the process input-output data series 
without physical knowledge of the process. This type of model is referred to 
as a black-box or behavioural model (such as neural networks). At the other 
extreme, a model may be developed from the application of first principles 
(conservation laws) to the process. Such models are called phenomenological 
or knowledge-based. 

The knowledge-based models are further classified as: 

— Lumped-parameter models in which the state variables have no spatial de- 
pendence, and therefore the models consist of ordinary differential equations 
(e.g. the classical continuous stirred tank reactor). 

— Distributed-parameter models, in which the state variables are position-de- 
pendent and the models take the form of partial differential equations (e.g. 
a tubular reactor). Very often, a distributed-parameter model is discretized 
(division of a tubular reactor into n continuous stirred tank reactors) so as to 
transform the partial differential equations into a set of ordinary differential 
equations that are more tractable mathematically. 

Furthermore, models can be linear or nonlinear. A process model is linear if all 
variables describing the process appear linearly in the equations. It is nonlinear 
in the opposite case. The advantage of linear models is that they can be easily 
transformed by mathematical mappings and their mathematical behaviour is 
well known. 

Among the many forms of models that will be used in this book, the transfer 
functions (in the continuous 5 Laplace or discrete z domains) and the state- 
space models are of special significance. 
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A key point which must be remembered is that a model, however sophis- 
ticated it may be, remains an approximation of the real process. It will often 
differ according to the pursued objective. 



1.5 State-Space Models 



Generally, a state-space multivariable system is modelled by a set of algebraic 
and differential equations of the form 



x= f{x,u,t) 
y= h{x,t) 



where x is the state vector of dimension n, u is the input vector (or control 
variables vector) of dimension riu and y is the output vector of dimension Uy 
(in general, riu > Uy). The state x of the system at any time t is described 
by a set of n differential equations, which are often non-linear. The states x{t) 
depend only on initial conditions at to and on inputs u{t) between to and t. 
At the initial time, the state variables are the initial conditions and later they 
represent the evolution of the state of the system. 

A remarkable characteristic of the non-linear dynamic knowledge-based mo- 
dels in chemical engineering is that they are described by differential equations, 
either ordinary or partial, which are usually affine with respect to the input 
vector. Thus, the most common form of a non-linear state-space model is 

f X = f{x,t) +g{x,t)u 

{ ( 1 - 2 ) 

1 y = h{x,t) 

An important class is linear state-space models of the form 



x{t) = Ax{t) + Bu{t) 
y(t) = Cx(t) + Du(t) 



(1.3) 



which includes any process model described by a set of n linear ordinary diffe- 
rential equations. A is the state matrix of dimension (n x n), S is the control 
matrix of dimension (n x n^^), C is the output matrix of dimension {uy x n), 
and D is the coupling matrix of dimension {uy x riu) which is very often equal 
to zero. When D is different from zero, it is said that the output is directly 
driven by the input. 

In some cases, it is necessary to model a linear system under the following 
generalized state-space representation or descriptor form 



Ex(t) = Ax{t) + Bu{t) 
y(t) = Cx(t) + Du(t) 



(1.4) 



where E is called the descriptor matrix of dimension (n x n). If E is invert- 
ible, the system can be rewritten under the form (1.3). If is singular, this 
corresponds to an differential- algebraic system. 
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The linearization of a set of differential equations of the most general form 

x = f{x,u,t) (1.5) 

around a steady-state operating level Xq for a nominal input no can be achieved 
by the following Taylor series expansion 

±i = fi{xo,uo,t) + ^(^) + Suk 

7 = 1 y X=Xo,U=Uo / X=Xo,U=Uq 

( 1 . 6 ) 

with: 5xj = Xj —Xj^o^ Suk = Uk — Uk^o where Xj^o is the j component of steady- 
state vector xo and is the k component of the nominal input vector no- S 
indicates any deviation with respect to the nominal operating condition, thus 
Sx and Su are respectively deviations of the state and of the input with respect 
to the steady state of the process. One gets Xi = Sxi + Xi^o and Xi^o = so 
that the following set of linear ordinary differential equations is obtained 

j = l y X=Xo,U=Uo k=l ^ ^ y X=Xo,U=Uo 

which can be easily written in a more condensed matrix form similar to Eq. 
(1.3). Similarly 

j = l y X=Xo,U=Uo y X=Xo,U=Uq 



The form of Eqs. (1.7) and (1.8) is referred to as the linearized state-space 
model with the matrices of the linear state-space model indicated by their 
current element 



A and B are respectively the Jacobian matrices of / with respect to x and n, 
while C and D are respectively the Jacobian matrices of h with respect to x 
and u. 

Notions of controllability and observability in the state space will be spe- 
cially studied in Chap. 7. 



1.6 Examples of Models in Chemical 
Engineering 

With the help of certain classical examples in chemical engineering, we will 
show how various chemical processes can be described in the state-space form. 
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1.6.1 Lumped- Parameter Systems 

In lumped-parameter systems, the process variables depend only on time, which 
is the independent variable. The dynamic behaviour of the process is then 
described by a set of ordinary differential equations. 

A Surge Tank 

Consider a cylindrical tank fed by an incompressible liquid (Fig. 1.4) at a 
varying flow rate Fq (m^/s). The exit flow rate F is also time-dependent. 



Fo 











h 





Figure 1.4: A surge tank with varying level 

At steady-state, the level in the tank is constant and the mass balance requires 
equality of inlet and outlet mass flow rates 

FoPo = Fp (1.10) 



where p is the fluid density. 

In transient regime, the liquid level h in the tank varies with time. 
The general mass balance formulated as 

(inlet mass / unit time) = (outlet mass / unit time) + 
(time rate of change of mass in the system) 

gives the dynamic model of the tank 



Fq po 



F pF 



djpV) 

dt 



( 1 . 11 ) 



This is the state-space equation of the process. 

If the liquid density is constant {p = po) and the tank cross-sectional area 
S does not depend on the liquid level, the previous mass balance will become 



dh 

dt 



Fo/S - F/S 



( 1 . 12 ) 



One notices that only the liquid level in the tank, the controlled variable, 
appears in the derivative. With the section area S being a constant parameter. 




Process Control 



11 




Figure 1.5: Surge tank with varying level with valve on the inlet stream 



this ordinary differential equation is linear. Equation (1.12) can be considered 
as the fundamental model of a level control system. 

Assuming that a valve is placed on the inlet pipe (Fig. 1.5), the inlet ffow 
rate Fq will become the control (manipulated) variable of the system. In state 
space, the system is single-input single-output (SISO), with an input n = Fq, 
an output y = h and only one state: x = h. The state-space model is 

X= u/S-F/S 

y = X \ ' J 

In this linear model, the cross-sectional area 5" is a constant parameter, and 
the ffow rate F is an external disturbance acting on the system. 

Fo 











h 







Figure 1.6: Surge tank with varying level with valve on the outlet stream 

In the case where the valve is on the outlet stream (Fig. 1.6), the manipu- 
lated input is the outlet ffow rate F, so that the state-space model becomes 



x= Fo/S-u/S 
y= X 



(1.14) 



In this case, the ffow rate Fq is a disturbance. 
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An Isothermal Continuous Chemical Reactor 

The cylindrical reactor (Fig. 1.7) is assumed to be perfectly mixed, i.e. the 
temperature T, the concentration of any given species, the pressure, etc. are 
identical at any location of the reactor and keep their value in the effluent 
stream. The reactor is fed by two streams, one having a flow rate Fq contai- 
ning the reactant A, and the other one an inert stream with a flow rate F\. 
Both streams have their temperature equal to that of the reactor contents. Fur- 
thermore, we will assume that the heat of reaction is negligible so that there is 
no need to write the energy balance. 



Fq , po , T Fi , Pi , T 




Figure 1.7: An isothermal continuous stirred tank reactor (CSTR) 

The overall mass balance equation is similar to that of the previous tank 

^^ = Fopo/S + Fipi/S-Fp/S (1.15) 

and we assume that the densities in different streams are equal. 

The balance for component A in transient regime is given by the continuity 
equation for A: 

(rate of A entering) = (rate of A exiting) — (rate of A produced) + 
(rate of accumulation of A), 



giving 

FoCao=FCa-RaV+^F^ 



(1.16) 



noting that Fko = Fo Cao is the molar flow rate (mol/s) of component A in the 
inlet stream, and similarly is the outlet molar flow rate. Equation (1.16) 
can be written as 



FL = n-RAVF 



djVCA) 

dt 



(1.17) 



The term Ra represents the number of moles of A produced per unit volume 
and unit time; it can be called the production rate of A (Levenspiel, 1999; 




Process Control 



13 



Villermaux, 1982). When R reactions designated by i occur simultaneously, 
the rate of production of a component Aj is equal to 

R 

(1-18) 

i=l 



where each reaction rate is in general positive. The stoichiometric coefficient 
i>ij > 0 if Aj is produced by reaction and i>ij < 0 if Aj is consumed by reaction 
i. Note that this dehnition of reaction can be applied to either a continuous 
reactor, a batch reactor (Fq = F = 0) or a fed-batch reactor(F = 0 and 
Fo 7^ 0). The chemical advancement (dimension: mol) and the generalized 
yield y (without dimension) are dehned such that: 

• For a closed reactor: 

Taking no as the total number of moles of reacting species present at a refe- 
rence state, in general the initial state, the number of moles of a component 
Aj at any other state is equal to 

R R 

rij = rijo + Vij = rijo + F = F V*) (1-19) 

i=l i=l j 



given the rate of reaction i: 



V dt ~ V dt 



( 1 . 20 ) 



• For an open continuous stirred reactor: 

The reference is in general the inlet molar flow rate Fq including all entering 
reacting species j present in the reference state. So the molar flow rate Fj 
of a component Aj at any point of the system is equal to 






'ij Xi 



i=l 



i=l 



with:F' = y^q'o (1.21) 

3 



given the rate of reaction i: 



y/CiyOut 



Fo Co , , 

\Xi,out Xiyin) 



( 1 . 22 ) 



where in denotes the inlet stream and out the exit stream. Co is the total 
concentration in the reference state of all the constituents. The residence 
time r is equal to 



^ ^0 (yXi,out Xi,in) 

Fo Vi 



(1.23) 



In the general case of R simultaneous reactions. Equation (1.16) becomes 



5^ 



d{hCA) 



R 

Fo Cao — F Ca + F 

i=l 



(1.24) 



dt 
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In the case where only one first-order chemical reaction occurs: A ^ the 
reaction rate is equal to ta = and the production rate of A is equal 

to Ra = —ta^ as UiA = — 1. Equation (1.16) becomes 

FoCAo=FCA+kCAV+ (1.25) 



which can be transformed into 

^ = ( 1 . 26 ) 

This balance will be used as the fundamental model for the control of the 
concentration Ca- 

The differential equation describing the variations in the concentration Ca 
is in general nonlinear, since the inlet and outlet flow rates Fq and F are time- 
varying and the reaction rate can be a complicated function with respect to 
concentration. Frequently, a chemical reaction is either endothermic or exother- 
mic. Therefore, an energy balance equation should be added to the previous 
differential equations. 

Assuming that the inputs are the inlet volumetric flow rate Fi and inlet 
concentration Caq^ Ihe control vector is n = [Fi,(Taq]^. We wish to control 
the reactor level and concentration, thus the output vector is y = [/i, 

The process is a 2 x 2 multivariable system. The state vector is chosen to be 
equal to = [/i, The state-space representation of the isothermal reactor 

with a first-order reaction is 



±1 = 


Fo/S-F/S + ui/S 




±2 = 


[Fo{u2 X2) X2Ui] kX2 


(1.27) 


yi = 


X\ 




V2 = 


X2 





This multi-input multi-output (MIMO) model is nonlinear even if we assume 
that the level h is perfectly regulated by an independent controller because of 
the differential equation describing the concentration variations. Disturbances 
are flow rates Fq and F. 

Frequently, when the main objective is concentration control, the influence 
of level variations is considered as secondary and the level can be regulated 
independently. 

A Non-isothermal Continuous Chemical Reactor 

Figure 1.8 represents the schematics of a non-isothermal continuous chemi- 
cal reactor with a heating/cooling jacket. The heating/cooling medium may 
also be supplied by means of a coil immersed inside the reactor. It is used for 
cooling in cases of exothermic reaction or desired temperature decrease, and 
for heating in cases of endothermic reaction or desired temperature increase. 
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The rate of heat transfer transferred between the reacting mixture and the 
heating/cooling medium is Q. Q is positive for heating the reacting mixture 
and negative in the opposite case. Q is given by 

Q = USex{Tj-T) ( 1 . 28 ) 

where U is the overall heat transfer coefficient, /Sex is the available heat ex- 
change area, Tj is the mean temperature in the jacket, and T is the temperature 
of the reactor. 



Fo , po , ^0 Fi , Pi , Ti 




Figure 1.8: A non-isothermal continuous stirred tank chemical reactor 

(CSTR) 

Apart from the overall mass balance and component mass balance on A, the 
energy balance written in general terms as 

(variation of internal energy / unit time) = 

(inlet enthalpy by convection / unit time) — 

(outlet enthalpy by convection / unit time ) + 

(rate of heat transfer and mechanical energy) 

must also be considered. The energy balance of the reactor is thus as follows 

~ ^in ~ ^out ^out + Q ( 1 . 29 ) 

where is the total inlet molar flow rate and is the total outlet molar 
flow rate, h' is the specihc molar enthalpy of each stream given by 

^in ^ ^ ^j,in ^j^jn ^out ^ ( 1 . 30 ) 

j 3 

where Xj^in and Xj^out are the inlet and outlet mole fractions respectively and 
^ and are the specihc enthalpies of component j in the inlet and outlet 

streams respectively. The specihc molar enthalpy h' of a pure component can 
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be expressed with respect to its enthalpy of formation AHf{Tref) at a reference 
temperature T^ef according to 



h' = AH f{Tref)^ [ C'pdr 



(1.31) 



provided that there is no change of state between T^ef and T. If, however, there 
is a change of state, the corresponding latent heat of transformation should be 
accounted for. is the molar specific heat of the component at constant 
pressure. 

The change of total internal energy U of the reactor contents can be ex- 
pressed in terms of the specific molar internal energy by 



dU d{nu) d{H-PV) d{nh) ^d^ ^ d^ 

dt dt dt dt dt dt 

For isobaric operation and negligible pressure work, there remains 

m _ d(nh) 
dt dt 

The enthalpy change Ahi due to reaction i is equal to 



(1.32) 



(1.33) 



^hi = ^ 

j 



Vij hj 



(1.34) 



which concerns only reacting and produced species, not inert ones. For R 
reactions, the total enthalpy contribution dh linked to the reactions is 



R 

dh = Vdt ri Ahi 

i=l 



(1.35) 



The enthalpy of the reactor walls and its accessories such as the mixer 
should also be considered. Assuming that the mass of the reactor wall and its 
accessories is represented by and the corresponding heat capacity is (F^, the 
overall thermal balance after some mathematical manipulation can be written 
as 



^ 

(m Cp + 'ITT-r Cr) — = Fin Pin Cp,in {Tin ~ T) + Q - V ^ r j Ahi (1.36) 

in i=l 

where m is the mass of the reactor contents, Cp its mean specific heat calculated 
at the reactor temperature T, and the summation is carried out over all inlet 
streams to the reactor. Note that the reactor is perfectly mixed, therefore the 
exit temperature and concentration are identical to those in the reactor. The 
heats of reactions Ahi are calculated at the reactor temperature. 

Consider again the first-order chemical reaction A ^ B with heat of reaction 
AhA^B^ taking place in the reactor shown in Fig. 1.8. The temperature 
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dependency of the reaction rate is expressed by the Arrhenius equation 
r = ko e-xp{—E/{RT)) Ca, which is a highly nonlinear term. When applied to 
the reactor shown in Fig. 1.8 with its two inlet streams, Eq. (1.36) becomes 
(assuming that densities po, Pi, P and heat capacities are constant) 

dT 

{m Cp + TTir Cr) = Eq po Cpo (Tq — T) + Fi p\ Cp\ (Ti —T)^Q — V r AHa^b 

(1.37) 

where Cpo is the mean specihc heat of the inlet stream with volumetric flow 
rate Fq. The densities are assumed to be identical and constant. 

Notice that the differential equation describing the temperature variation 
is nonlinear. This model will be used for temperature control studies in the 
remainder of the book. 

The behaviour of the chemical reactor shown in Fig. 1.8 is described by a 
set of three coupled ordinary differential equations. 

In addition to the level and concentration in the case of the isothermal 
reactor, temperature must also be controlled. Moreover, it is assumed that a 
valve allows us to manipulate the thermal power Q introduced in the jacket. 
This can be performed by a heat exchanger. Therefore, the output vector is 
y = [P, (F^,T]^, the control vector is n = [Fi, and the state vector 

is = [P, Ca^ T]^ . 

In this case, the state-space model obtained from balance equations is as 
follows 

^±1= F^/S-F/S^ui/S 

±2 = [Fo{u2 -X 2 ) -X 2 U 1 ] -Po exp(-F/(Fx 3 ))x 2 

^3 = — ^ yr [^0 Po Cpo (To — X3) T ui pi Cpi (Ti — X3) 

P p X I TTlj'p f 

I — Po exp(— F/(F x 3)) /Sxi X2APT + 1^3] 

yi = xi 

V2 = X2 

. P3 = ^3 

(1.38) 

The state-space model is multi-input multi-output and nonlinear because of the 
two differential equations describing respectively the variations of concentration 
and of temperature in the reactor. Nevertheless, this model is affine with 
respect to manipulated variables. 

In general, this model will be too complicated for control system design 
and analysis. If one is primarily concerned with temperature control, only the 
energy balance will be taken into account. 

Staged Processes 

A tray distillation column is composed of a hnite number of stages, similar 
to a tray absorption column or a multi-stage mixer-settler for liquid-liquid 
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extraction. In a staged process, the overall mass balance, the component mass 
balance and the energy balance are applied to each stage separately. The 
full state-space model of a staged process is thus obtained by gathering the 
stage models and taking into account the relations between the stages and the 
environment. Therefore, the overall model of these processes consists of a large 
number of ordinary differential equations adequate for dynamic simulation but 
which pose difficult problems for control studies and implementation. The 
number of differential equations (the model order), however, can be reduced by 
efficient model reduction techniques to obtain approximate low-order models. 
The reduced model must keep the main dynamic characteristics of the original 
high-order model. An example of model reduction for a distillation column is 
given in Chap. 20. 



1.6.2 Distributed-Parameter Systems 



When process variables depend simultaneously on time and spatial variables, 
the process is described by partial differential equations. 

A chemical tubular reactor, chromatography column, packed absorption or 
distillation column are examples of distributed-parameter systems. Only one 
example will be presented here: an isothermal tubular reactor (Fig. 1.9). If 
noticeable heat exchange occurs in the reactor, the energy balance equations 
must be considered as previously with heat exchange occurring at the wall. 



^ z -\- dz 



v{z) 



v{z + dz) 



z=0 C(z) z = L 

P{z) 

v{z) 



Figure 1.9: Schematics of a tubular reactor 

The ffow is assumed to be fully turbulent, which results in a ffat velocity profile 
and justifies the plug ffow assumption. For a plug ffow reactor, the reaction 
rate and all the process variables are constant over a given cross-section, i.e. 
the radial variations are discarded and only the axial variations are considered. 
The chemical reaction is identical to the previous cases: A ^ first-order, 
and the reaction rate ta is given by ta = ko exp(— F^/(i?T)) Ca- 

The conservation principles, in the case of distributed-parameter systems, 
are applied to an infinitesimal volume (control volume) in which the fiuid prop- 
erties may be assumed constant. The shape of the control volume depends 
on the geometry and the ffow conditions in the reactor. For example, for a 
plug ffow reactor, the control volume is a cylinder of thickness dz (example of 
Fig. 1.9), for a tubular reactor with both radial and axial variations in process 
variables, the control volume will be a ring with height dz and radial thickness 
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dr^ and for a tubular reactor with variations of process variables in radial, axial 
and angular directions, the control volume will be a sector of a ring. 

Consider a cylindrical control volume between z and z -\- dz ai time t. All 
the variables such as the density p, velocity v and concentration Ca depend on 
time t and axial dimension z. The mass balance is performed on the control 
volume with thickness dz, and cross-sectional area 5", giving 

v{z) S p{z) = v{z + dz) S p{z + dz) + dzp{z)) (1.39) 



which is simplihed to 



dp d{pv) 
dt dz 



(1.40) 



Diffusion is related to the axial concentration gradient in the reactor. 

The axial diffusive ffux of component A (moles per unit time and unit cross- 
sectional area) is expressed by Tick’s law 



Na 



Va 



dCA 

dz 



(1.41) 



where Va is the turbulent diffusion coefficient and the corresponding mass 
balance on A over the control volume is 



v{z)SCa{z)^SNa{z) 



which can be simplified as 



v{z + dz) S Ca(z + dz) + S Na{z + dz)-\- 



dt 



(1.42) 



OCa dvCA 
dt dz 



+ kCA 



_d_ 

dz 



Va 



dCA 

dz 



= 0 



(1.43) 



Such equations, however, are too complex for control applications. It is 
well known that a tubular reactor can be represented as a series of n perfectly 
mixed continuous stirred tank reactors, where n is large (infinite in theory). 
For control purposes, an approximate dynamic model is often sufficient. For 
example, a tubular reactor can be modelled as a cascade of a few (possibly 
three) perfectly mixed continuous stirred tank reactors, i.e. for i (1 < i < 3) 
the overall and component mass balances become 

/ 

Pi—l Z^i—1 — 

< 

Z^i—1 CA,i—l — 

\ 

Note that a rigorous simulation model will need a much larger number of 
elementary reactors in series. 



Vi Ca,i -\- k/ S Ca,i + Az 



dCA,. 

dt 



(1.44) 
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1.6.3 Degrees of Freedom 

A state-space model can represent either the steady state or the transient be- 
haviour of a system. The steady- state solution obtained by setting all time 
derivatives to zero constitutes the intialization of the dynamic regime. The 
following discussion on degrees of freedom could be applied to steady state, 
but is here devoted to control, thus to the dynamic model. 

The number of degrees of freedom ndf of a system is equal to the number 
of variables minus the number of equations 

ndf = number of var. — number of eq. 

If the degrees of freedom ndf is zero, the system is fully determined (or specified) 
and there exists only a unique solution; if it is positive, the system is under- 
specified and ndf equations should be added; if it is negative, the system is 
over-specified and ndf equations should be removed to get a unique solution. 

Each control loop adds an additional equation. Furthermore, the external 
disturbances are also specified to reduce the number of unknowns. 

1.7 Process Stability 

A process is said to be stable (asymptotically) when in response to a distur- 
bance, the state variables converge towards a steady state (the system is said to 
be feedback- negative). Another definition of stability is that a process is said 
to be stable if any bounded input results in a bounded output. If a bounded 
input results in an unbounded output, the process is unstable. 

The process is unstable when in response to a disturbance some state va- 
riables tend mathematically towards infinity (the system is feedback-positive). 
In practice, that means simply that the variables go out of the desired domain 
or do not tend to come back in a stable manner, but oppositely go far from it 
at least periodically. 

Nearly all chemical processes are stable in open loop. However, a CSTR with 
an exothermic reaction can be unstable. Indeed, if the cooling is insufficient 
with regard to the heat of reaction, there may be three stationary states, one 
stable at low temperature and low conversion, one stable at high temperature 
and high conversion, a third unstable state at an intermediate temperature 
and conversion (see Sect. 3.2.3). Nearly all processes can be made unstable 
in closed loop. A major recommendation for the controller design is to avoid 
instability. 



1.8 Order of a System 

If a process is described by an ordinary differential equation of order n, the 
process is said to be of order n 
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where f{t) represents an input or a forcing function. Note that an ordinary 
differential equation of order n is equivalent to a set of n first-order ordinary 
differential equations. 

The first-order process model 



f(t) =aox + ai 



dx 

dt 



can be written as 

fit) =ao(x + r— ) 

where r is the process time constant. 

The second-order process model 



dx 

f{t) = ao X + ai — + 



dt‘^ 



can be written as 

where ( is the damping coefficient. 



(1.46) 

(1.47) 



(1.48) 

(1.49) 



1.9 Laplace Transform 

The Laplace transform is an elegant mathematical method to solve linear or 
linearized differential equations. In control theory, it is used to develop simple 
continuous input-output models and thereby analyze the infiuence of external 
variables on a given process. 

The Laplace transform of a function f{t) is defined by 

pOO 

m{t)] = C(s) = / f{t) exp{-st)dt (1.50) 

Jo 

assuming that the function or signal f{t) is zero for t < 0. This is the mono- 
lateral Laplace transform which is used throughout this book. Notice that the 
exponential term has no dimension, therefore the dimension of variable s is the 
inverse of time (frequency). 

If function f{t) presents discontinuities at the boundaries, the Laplace trans- 
form is defined as 

C[f(t)] = F{s) = lim J f(t) exp{—st) dt (e ^ 0 , T ^ oo) (1-51) 

The bilateral Laplace transform for nonzero functions for negative t is de- 
fined as 

/ oo 

f{t) exp{-st)dt (1.52) 

-OO 

and is identical to the Fourier transform if we set s = j oo. The Laplace trans- 
form exists only if the integral (1.50) is bounded: the function f(t) exp(— st) is 
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summable in Lebesgue’s way. For example, consider the function f{t) = exp(t). 
This function is unbounded when t +oo. However, let us try to calculate its 
Laplace transform. We get 

POO 

^[/(O] = / exp(t) exp{—st) dt = 

Jo 

= Y ~ — if- > 1 

Its Laplace transform would be defined only in a frequency domain excluding 
low frequencies. Its convergence region is the domain of s complex values such 
that the Laplace transform exists, here the real part of s should be larger than 

1 . 

If we consider the step function f{t) = 1, if t > 0, f{t) = 0 else, its Laplace 
transform is 



5 — 1 






n.53^ 



mt)] = 

= - if: lZe{s) > 0 

Its convergence region is the complex right half-plane. 

The inverse Laplace transform is defined as 

-1 P(T+j OO 

f(t) = C~^[F{s)] = - — : / F{s) exp{ts)ds 
2 7TJ Ja-joo 

This integral is defined on a complex domain with s = a -\- j oo. 
The Laplace transformation is a linear mapping 



1 exp(— 5t) dt = 






+ 00 

0 



^aifi{t) +a 2 f 2 {t)] = ai£[fi{t)] +« 2 ^[/ 2 (^)] 



(1.54) 



(1.55) 



(1.56) 



To get the inverse Laplace transform of F{s), it is in general useful to expand 
the function F{s) which very often takes the form of a rational fraction, as a 
sum of simple rational fractions, and then to operate the inverse transformation 
on each fraction separately. 



1.9.1 Linearization and Deviation Variables 

What is a Nonlinear Model? 

Model linearization often poses problems for students. One reason may be that 
the analysis of the nonlinearity of the model is not clear. First, let us give some 
mathematical examples. 

Consider a function of a single variable f{x): 
f{x) = Ax and f{x) = 2x + 3 are linear with respect to x, 

/(x) =3x2, ^ ^ are nonlinear with respect to x. 

A function of a single variable is linear with respect to this variable when 
the derivative of this function is constant. Otherwise, it is nonlinear. 
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Consider a function of two variables f{x,y): 

/(x, y) = 2 ^ + 3 is linear with respect to y and independent of x, 

/(x, y) = 2 X 6y 5 is linear with respect to x and y, 

/(x, y) = xy is nonlinear with respect to x and y, 

/(x, y) = 2x^ -\-3y is nonlinear with respect to x and linear with respect to y, 
/(x, y) = 4 X + 2 + 3 ^ + 5 is nonlinear with respect to x and y. 

A function of several variables is linear with respect to one of its variables 
when the partial derivative of this function with respect to the considered 
variable is constant. Otherwise, it is nonlinear with respect to that variable. 



Significance of Linearization in Process Control 

Two cases can be considered: either a hxed set point is imposed on the process 
(case of regulation), or a reference trajectory is to be followed by the process 
(case of output tracking). Consider the simpler hrst case. We wish the process 
to be maintained in the neighbourhood of a set point, which will thus be the 
steady state. Due to imperfections of the control, the output and the state 
variables move around their steady-state values. The difference between the 
transient value of any variable and its steady-state value is called a deviation 
variable. When this latter remains small with respect to its absolute value, the 
behaviour of a function of this variable can be approximated by the tangent at 
the considered point (Fig. 1.10) 

/(x) « /(x®) + /'(x®) (x - X®) (1.57) 




Figure 1.10: Linear approximation of a nonlinear function by linearization of 
this function around steady state 
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General Linearization 

A nonlinear state-space model of a process must be linearized before apply- 
ing the Laplace transform. The system thus obtained is called a linear time- 
invariant (LTI) system. Consider again the example of the chemical reactor 
(Eq. 1.16); the term FCa^ which appears in the differential equation in the 
component mass balance, is nonlinear as F{t) and CA{t) both depend on time. 
As the Laplace transformation is a linear mapping, it is therefore necessary to 
linearize the balance ordinary differential equations around steady state (de- 
noted by ^); thus, the product F Ca becomes 

F{t) CA{t) =F^C%+ F* {CA{t) - CA + C% {F{t) - F*) + Q{e^) (1.58) 

The term O(e^) indicates that the Taylor series expansion in the neighbourhood 
of the steady state is truncated at the first order. Generally, for a function / of 
n variables xi, . . . , by neglecting the higher-order terms, the Taylor series 
expansion leads to 

/(xi,...,x„) w {xi-xD 5f Sxi 

(1.59) 

It can be noticed that the linearization results in deviation variables (with 
respect to the steady state or a reference state) of the form Sxi = (xi — xf) 
which play an important role in Laplace transformation. 

1.9.2 Some Important Properties of Laplace 
Transformation 

• The Laplace transformation is a linear operation 

+a 2 f 2 (t)] =aiC[fi(t)] +« 2 ^[/ 2 (^)] (1-60) 

• The Laplace transform of a first-order derivative of a function is 

C[^]=sF{s)-m (1.61) 

If f{t) is a deviation variable with respect to the initial steady state, its initial 
value becomes zero: /(O) = 0, and the previous equation simply becomes 

C[^]=sF{s) (1.62) 

This assumption is used in general. 

• The Laplace transform of the n^^-order derivative of a function is 

= s” m - /(O) - s”-' ... - (1.63) 
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If f{t) is a deviation variable, its initial value and successive derivatives up to 
the (n — l)-th-order become zero so that the previous formula becomes 

C[^^]=s^F{s) (1.64) 

• The Laplace transform of the integral of a function is 

C[ f f (x) dx] = - F (s) (1.65) 

Jo ^ 

• The initial value theorem is 



lim/(t) = lim sF{s) 

t^O s^oo 


(1.66) 


• The hnal value theorem is 




lim /(t) = lim sF{s) 

s^O 


(1.67) 



Notice that the hnal value theorem cannot be applied in the case of a function 
corresponding to an unstable system. For example, consider the Laplace trans- 
form F{s) = 1/(5 — 1), which would be the transform of function f{t) = exp(t) 
if we strictly apply Table 1.1. Let us try to apply the hnal value theorem 
(1.67) to this function. It gives 

lim f(t) = lim — - — = 0“ (1.68) 

t^+oo s^O S — 1 



One would wrongly conclude that the function f{t) = exp(t) tends towards 0 
when t + 00 . The mistakes comes from the fact that one does not take into 
account the remark concerning Eq. (1.53). 

Consider the Laplace transform F{s) = l/(5(5 + 1)) corresponding to a 
stable system (we will later see that it is a hrst-order system subjected to an 
input step). The hnal value theorem gives 



lim 

t^+oo 



f{t) = lim 



0 5(5 + 1) 



= 1 



• The Laplace transform of a delayed function is 



C[f{t-to)] = exp{-sto) £[f{t)] 



(1.69) 



(1.70) 



Note that the function f{t — to) is the same as f{t) delayed by to, which means 
that at time to, the delayed function is equal to /(O) (Fig. 1.11). The delay 
corresponds to a time translation of the function. 

• The complex translation is 

£[/(t) exp(at)] = F{s — a) (l-^^l) 



• The scale change is 



jC[f{t/a)] = aF{as) 



(1.72) 
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Figure 1.11: Illustration of the effect of a time delay of 20 time units on a 
response 



fit) 


git) 


= /(c * 






Fis) 




H{s) = F{s)G{s) 


G(s) 







Figure 1.12: Convolution for a linear system. Top: time domain. Bottom: 
frequency domain 



• Laplace transform of convolution 

When a signal f{t) excites a linear time-invariant system with impulse response 
g{t) (Fig. 1.12), the response of the system h{t) is equal to the convolution 
product of f{t) by g{t) denoted by 

/ oo 

f{T)g{t-T)dT (1.73) 

-OO 

The Laplace transform H (s) of the output is equal to 

= (1-74) 

which is expressed as the Laplace transform of a convolution product being 
equal to the product of the Laplace transforms of the functions. 

A consequence of this property is that when a signal f(t) excites two linear 
systems in series gi(t) and g 2 {t), the response h{t) is equal to 

Kt) = fit) * iaiit) * g2{t)) (1.75) 

and its Laplace transform H (s) is 

m(t) * (gi(t) * g2it))\ = C(s) Gi(s) G2 (s) 



(1.76) 
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Thus, the Laplace transform of the impulse response of two linear systems in 
series is equal to the product of the Laplace transforms of the individual impulse 
responses. 

• The complex convolution is 

1 /‘Cr+joo 

^f(t)9it)] = — / F{q)G{s-q)dq (1.77) 

Ja—joo 



• Parseval-Plancherel relation 

This relation, classical in signal processing, expresses that the energy of a signal 
is equal to the sum of the energies of its constitutive signals 








(1.78) 



It is necessary that the signal f(t) be square-integrable, which means that the 
integral of must exist. 

• Differentiation or integration with respect to a parameter 
Consider the function /(t, 0) which depends on parameter 6 as well as time t 
(both are independent variables) the Laplace transforms of the derivative or 
integral of the function with respect to 6 are 



39 






(1.79) 



and 




• Table 1.1 lists the Laplace transforms of some common functions. 



(1.80) 



1.9.3 Transfer Function 

Definition of a Transfer Function 



Consider a linear single variable system whose dynamic behaviour is described 
in terms of deviation variables by an ordinary differential equation of order n 
linking the input and the output 



bo u{t)-^bi 



du{t) 

dt 



+ . . .-\-bm 



d^u{t) 

dt^ 



aoy{t)^ai 



dy{t) 

dt 



+ . . .-\-an 



dt^ 



, m < n 



(1.81) 

where u{t) and y{t) are the system input and output respectively (Fig. 1.13). 
If we assume that the system is initially at steady state, the deviation variables 
and their successive derivatives are zero at initial state 



(5u(0) = 0, 



(5^(0) = 0, 



du 

dt 

dt 



= 0 ,. 

0 



= 0 , 



\dt^ 
( d^y^ 



= 0 

0 

= 0 



(1.82) 



0 



0 
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Table 1.1: Laplace transform of some common functions 



Signal f{t) {t > 0) 



Transform C[f(t)] = F(s) 



Convolution product /(t) * g{t) 



. dfjt) 
dt 



F{s)G{s) 



Derivative: 



sF{s)-f{0) 



Integral: f(x) dx 






Delayed function: /(t — to) 



exp(-gto) F{s) 



Dirac unit impulse: 6{t) 



1 



1 1 — exp(— s/r) 
T s 



Unit impulse of duration r dehned by 
(5r (t) = 0 if t < 0 or t > T 
(5r(t) = i if 0 < t < T 



~X~ 



Step of amplitude A 



Exponential: exp(— at) 



r exp(-i/r) 



T g + 1 



Ramp: at 



t exp (—at) 



T 



(s + a) 



t^ exp(— at) (n > 1) 



{s + g) 



n+l 



-CO 

2 

s T CO 



sin(o; t) 



s 

2 

s T CO 



cos{co t) 



CO cos{(f)) + s sin Wf 



sin(o; t + 0) 



2 , 2 
S -\- CO 
(jj 



(g T g) T eg 



exp(— at) sin(o;t) 



g + g 



exp(— at) cos(o;t) 



(g T g) T eg 



U 

n! 



(exp(-ai) - exp(-6i)) 



1 

n+l 



(exp(-i/Ti) -exp(-i/T 2 )) 



(g + a)(s + b) 



Tl - T2 



(ns + 1) (t 2 S + b) 



E n 

i=l 



exp(— g^ t) 
- ai) 



n(s+ 



^ exp(-Cc^t) sin(g;pt) 

with: cOp = CO (0 < |C| < 1) 

I 



g T 2 g T O’ 



■ (1 — exp(— g t)) 



g(g + g) 



The Laplace transformation of (1.81) gives 

(bo + 6i 5 + . . . + bjYi s^) U (s) = (ao + ui g + . . . + aji s^) Y (s) (1.83) 

The transfer function of the system results as the ratio of the Laplace trans- 
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(a) 


u{t) 


Process 


y(t) 




Input 


Output 



U{s) 



(b) 



G{s) 



Y{s) 



Figure 1.13: Block diagram of a process (a) in the time domain, (b) in the 
Laplace domain (transfer function) 



form of the output variable to the Laplace transform of the input variable, both 
expressed in terms of deviations from their steady states 



y js) _ Q/\ _ bo bi s bm 

U (5) Gq -\- CLl s Gfi 5 ^ 



(1.84) 



Thus, the transfer function is totally equivalent to the linear ordinary differen- 
tial equation describing the linearized system and can be further used to find 
output solutions to a given input. 

Most transfer functions take the previous form of a ratio of two polynomials, 
symbolized as 

GM = fg} (1.85) 

A transfer function is said to be proper if 

degree of N{s) < degree of D{s) 

it is strictly proper if 

degree of N{s) < degree of D{s) 

A transfer function is said to be biproper if 

degree of N{s) = degree of D{s) 

A transfer function is said to be improper if 

degree of N{s) > degree of D{s) 

It will be shown that improper transfer functions, such as an ideal derivative, 
amplify high-frequency noise. 

The following steps are followed to derive the transfer function of a 
process from a theoretical model: 

• Using the conservation principles, write the dynamic model describing the 
system, 

• Linearize equations using Taylor series expansion, 

• Express the equations in terms of deviation variables by subtracting the 
steady-state equations from the dynamic equations 

• Operate Laplace transformation on the linear or linearized equations, 

• Obtain the ratio of the Laplace transform of the output over the Laplace 
transform of the input. 
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Example 1.1: Application to the Surge Tank 

Equation (1.13) is linear with respect to all variables, and results simply in 

X(s) = ^C/(s)-^F(s) (1.86) 

which contains two transfer functions, both pure integrators. The process trans- 

fer function with respect to the input is 

G„(s) = ^ (1.87) 

S s 

and the load or disturbance transfer function with respect to the disturbance 
F is 

Gd{s) = ( 1 . 88 ) 

S s 

where S is the cross-sectional area of the surge tank and s is the Laplace 
operator. 

The block diagram (Fig. 1.14) represents the influence of the input and of 
the disturbance. 




Figure 1.14: Block diagram of the surge tank 



Example 1.2: Application to the Isothermal Chemical Reactor 

The system of Eqs. (1.27) is nonlinear with respect to the variables. Using the 
superscript “ 5 ” for the steady-state, linearizing the nonlinear terms, introducing 
the equations in terms of deviation variables and Laplace transforming results 
in 

sXi(s)= 1 [Fo(s)-F(s) + C/i(s)] 

sX 2 (s) = -^2^ [F^ - X^ 2 )Xl{s)] -kX 2 {s) 

[K - a:^)Co(s) + {U 2 {s) - X 2 {s)) - x* U,{s) - X 2 (s)] 

(1.89) 
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The steady- state equations resulting from Eq. (1.27) are 



0 = ^[F^-F^+u{] 

1 (1-90) 

0 = [Co® {u^2 - X 2 ) - ^2 wf ] -kx^ 

which yield 

uf = — Fq 

^ _ xl F^ + Skxf xl (1-91) 



Ui(s) 

U 2 {s) 




Xi{s) 

X 2 {s) 



Figure 1.15: Block diagram of the linearized isothermal chemical reactor tank 



The hnal system of transfer functions, the transfer function matrix, ex- 
presses the state vector with respect to the input vector and the disturbances 



Xi(s) 

W(s) 



i 

Js 



1 



Ss <S'(x|) 



h^F^^Skxl-F^)-£, 



+ 



+ 



Sxf 



s + 



■ + k 



s + 



1 






Fi 



1 Xn 

s s s{xiy 



Ssxl 



F® + Skxl - Co® 



s + 



' + k 



F{s) 



IL 

Sxl 

Tl 

S'xf 



Fo{s) 



Ui{s) 

U2{S) 



(1.92) 
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Note that the states x\ and X 2 are coupled by this system of equations. The 
first state x\ is influenced only by input ixi, while the second state X 2 is influ- 
enced by both inputs. With the system being multi-input multi-state, matrices 
of transfer functions relate the inputs and disturbances to the states. The li- 
nearized system described by the previous equation can be symbolized by the 
block diagram in Fig. 1.15. 

The non-isothermal chemical reactor could be treated in the same manner 
with added complexity. 



Impulse Response and Transfer Function of a System 

Consider a linear system in which the input and output are linked by the 
convolution product (Fig. 1.16) 

y{t) = u{t) * g{t) (1.93) 

If this system is excited by a Dirac impulse: u(t) = the output becomes 

y{t) = 9{t) (1-94) 

thus g(t) is the impulse response of the system and is equal to the inverse 
Laplace transform of the system transfer function as the Laplace transform of 
the convolution product is 

C[y{t)\ = C[u{t) * g{t)] = U{s) G{s) = G{s) (1.95) 

with U{s) = 1 for a unit impulse input. 



u{t) = S{t) 


9{t) 


y{t) = 9{t) 






U{s) = 1 




II 


G{s) 







Figure 1 . 16 : Time and frequency responses for a linear system subjected to 
a Dirac impulse input 

Although obtaining a system transfer function in this manner seems attractive, 
it suffers from several drawbacks: 

— A Dirac impulse is not realizable in practice, only an impulse of finite 
duration is possible. 

— A simple impulse input contains poor characteristics with respect to fre- 
quency excitation and introduces difficulty in process identification as will 
be shown later in this book. 
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In experimental characterization of the flow pattern in chemical processes, in 
particular reactors, however, the impulse response technique is used to get 
the residence time distribution. A tracer is injected over a short time at the 
process inlet and its evolution is recorded at the reactor outlet. The analysis of 
the output response is often complicated but it enables us to characterize the 
system as a combination of perfectly mixed CSTRs, tubular reactors placed in 
parallel or in series with possible bypass and/or dead volume (Levenspiel, 1999; 
Villermaux, 1982). 

Principle of superposition 

When the input can be decomposed into a sum of inputs (e.g. a rectangular 
pulse is the sum of a positive and a negative step, occurring at different times, 
see Figs. 1.17 and 1.18), the global output is the sum of the responses to the 
individual inputs 

U{s)=J2Ui{s) ^ Y{s) =G{s)U{s) =J2G{s)Ui{s) =J2Yi{s) (1.96) 

i i i 

The principle of superposition results directly from the properties of linearity 
of the Laplace transform. The principle of superposition is not valid for a 
nonlinear system. 

For example, a rectangular pulse can be seen as the sum of a positive and 
a negative step (in Fig. 1.17, us is the sum of ui and U 2 ). The respective 
responses to the three inputs for a given transfer function are given in Fig. 
1.18. 

Experimental Determination (Identification) of a System Transfer 
Function 

Consider a general system whose dynamics in the time domain can be described 
in terms of the following ordinary differential equation 

^ ,u{t) ,d) (1.97) 

where y is the output, u is the input, and d is the disturbance. To determine 
the system transfer function G{s), several possibilities exist depending on the 
choice of the applied input to the system: 

— For a Dirac impulse input, the output is equal to g{t), i.e. the inverse trans- 
form of the system transfer function G{s). The drawback of this technique 
is that the frequency content (the information content) of the input signal 
is not rich and results in poor identihcation. 

— For a step input or a succession of small amplitude positive and negative 
steps such as a pseudo-random binary sequence (PRBS), the information 
content of the input signal is adequate for yielding a satisfactory identihca- 
tion. This technique is particularly well adapted to discrete-time identih- 
cation. In the continuous-time domain, often a single step or a succession 
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Figure 1.17: Inputs from left to right: ixi, 1 x 2 , '^3 




Figure 1.18: Outputs from left to right: ^ 1 , ^ 2 , y?> as responses of the transfer 



function G{s) 



1 

35 + 1 



to respective inputs xxi, 1 x 2 , '^3 



of positive and negative steps is applied. System nonlinearities are demon- 
strated by dissimilar responses to positive and negative step changes in the 
input. For example, a furnace often shows a different time constant for a 
step increase or decrease in the heat rate. 

— For a sinusoidal input (Fig. 1.19), the identification is performed in the 
frequency domain. In this latter method, the Laplace variable s is replaced 
by 5 = j io (where co is the angular frequency in rad/s and is related to 
the frequency in Hz, u, hj co = 2 ir u). If a linear system is excited by a 
sinusoidal input u{t) = exp(jcjt), after a transient period, the output will 
also be a sinusoidal wave with the same frequency, a different amplitude 
and a phase difference, i.e. 

y{t) ~ G{j u ) exp(j cot) = G{s) exp(5t) for sufficiently large t (1.98) 

which, when combined with Eq. (1.97), gives 

sG{s) exp(st) = /(G(5) exp(st), exp(st), d) (1.99) 

Complex exponential functions (such as sines and cosines) are proper func- 
tions for the Laplace operator. In order to obtain a rich information for the 
experimental determination of the transfer functions, it is sufficient to vary 
the frequency co of the input signal over a large range. 
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u{t) = exp(jcjt) 

W) 



Process 

G{s) 



y(t) 



Figure 1.19: Experimental determination of a process transfer function by 
sinusoidal excitation 



Provided the function / is known, one analytically deduces from Eq. (1.99) the 
transfer function G{s) as the ratio of the output transform Y (s) over the input 
transform U(s). 



1.9.4 Poles and Zeros of a Transfer Function 



Consider a system described by the differential equation 



aoy(t)-hai 



dy{t) 

dt 



+ . . .-\-an 



dt^ 



bou{t)-^bi 



du{t) 

dt 



+ • • -Ybm 



d^u{t) 

dt^ 



(1.100) 



In the absence of an input excitation, the output Laplace transform is given by 



D(s)Y(s) = 0 



( 1 . 101 ) 



where D{s) = ao ai s an s'^ . The shape of the response curve depends 

on the roots of D{s) which are called the system modes. 

The system transfer function describes the response when the initial state 
is zero (refer to the relation between the transfer function and the state-space 
model) or when we refer to deviation variables. In general, the transfer function 
can be expressed as the ratio of two polynomials 



_ N(s) _ bo bi s bm 

^'■('5) — w; — r — 

D{s) Gq -\- ai s Gn 5 ^ 



with: n > m 



(1.102) 



where the denominator degree n is higher than or equal to the numerator degree 



The roots of the numerator polynomial N{s) are called the system zeros or 
zeros of the transfer function, since the transfer function is equal to zero for 
these values. 

If the numerator N{s) and the denominator D{s) have common roots, after 
cancellation of these common roots, the remaining roots of D{s) are referred 
to as the poles of the transfer function; they determine the response for zero 
initial state. 

If no common roots exist between N{s) and D{s), then the system can be 
completely characterized by its transfer function, and the sets of poles and the 
modes are the same. The transfer function G{s) becomes infinite when s is 
equal to one of the poles. 

If N{s) and D{s) have common factors, the latter form a polynomial noted 
R{s). The roots of R{s) are called system nodes, but are not poles for G{s). 
Eor that reason, they are called missing poles of the transfer function. In this 
case, the system is not completely characterized by its transfer function. 
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1.9.5 Qualitative Analysis of a System Response 

The response of a system to a given input u{t) can be determined from its 
Laplace transform 

r( 5 ) = G{s)U{s) (1.103) 

If the transfer function G{s) and the input transform U (s) are given, the system 
response y(t) can be obtained by inverse Laplace transformation of Y (s). 
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Figure 1.20: Different types of poles represented in the complex 5 -plane 



The behaviour of a system, such as its stability and the the shape of the system 
response, is largely determined by the poles of its transfer function. Let us 
consider a general transfer function with poles as depicted in Fig. 1.20 



C(5) 



D(s) 



N(s) 



(s - Si) (s - 82)^ (s - S3) (s - s^) (s - S4) (s - S5) (s - Se) (s - sg) (s - sr) (s - s^) 

(1.104) 

where 5i is a negative real pole 

52 is a negative real multiple pole of order m 

53 is a complex pole with negative real part 

53 is the conjugate complex pole {D being a real polynomial, all complex poles 
must appear as a pair in conjugate form) 

54 is a pole at the origin 

55 is a positive real pole 

56 is a complex pole with positive real part 
Sq is the complex conjugate pole of 5e 

57 is a pure imaginary pole 

57 is the conjugate of the imaginary pole 57 
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The transfer function G{s) can be expanded as a sum of rational fractions 



G{s) 



Cl 

S — Si 



+ 



C21 



+ 



C 22 



s S 2 (5 _ 52 ) 



2 + . . . + 



C2m 






+ — + 

^5 ^5 — 55 






+ 



C6 

5-56 



+ 



+ 



C7 

s — Sr 



+ 



C 3 

5-53 



+ 




* 

3 



(1.105) 

If poles are distinct, to find the coefficients in the numerators (residuals), it 
suffices to multiply G{s) by (5 — 5 ^) and set 5 = 5 ^ 



Ci = [(s-s*)C(s)]|^=, 



y(£) 

(s - Si) . . . (s - Sj_l)(s - Sj+l) . . . (s - s„) 



S = Si 



(1.106) 

In the case of multiple poles, e.g. if Si is a multiple pole of order m, the residuals 
are calculated by multiplying G{s) by (5 — Si)^ and differentiating successively 
with respect to 5 and setting 5 = 5 ^ after differentiation. Consider the following 
transfer function with a multiple pole Si of order m 












5 — 5i 



i-l) (s - Si)”* (s - Sj+i) . . . (s - s„) 

^i—1 I G,1 J C,2 I I 



H — 9i±l ^ . . . + 

I o c.,-, I ••• I 



I ^ I I I 

S-Si_l S-Si (s-Si )2 



(s - SiY 



(1.107) 

Let US define a new transfer function having the same poles as G{s) except for 
the multiple pole Si 

N{s) (5 - 5 ,)"^ 



Gt{s) = 



D{s) 



The residuals are calculated by the following expression 
1 d^^^Gt(s) 






j\ ds^ 



; j = 0, 1, . . .,m- 1 



(1.108) 



(1.109) 



Note that multiple poles are more frequently encountered in physical modelling 
than in identification. 

When a process having a transfer function G{s) of the form given by Eq. 

(1.107) is subjected to a unit impulse, the Laplace transform of the process 
output is equal to the process transfer function. To determine the time response 
to this impulse input, the inverse Laplace transformation is performed. 

• Simple real poles such as 5i and 55 result in exponential responses (Fig. 

1.21) 

y{t) = Ciexp(sit) (1.110) 

If the pole is negative real, the exponential tends towards 0: 5i is a stable 
pole. The closer the pole to the origin, the slower the response will be. If 
the pole is positive real such as 55 , the response increases exponentially with 
time and tends towards infinity: 55 is an unstable pole. 
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Complex pole with positive real part Complex pole with negative real part 



Figure 1.21: Impulse response of transfer functions with different types of 
poles 



• A multiple real pole with order i such as 52 results in the following responses: 

»(') = (uii) 

The response increases towards infinity if the pole is positive or zero, and 
decreases towards zero if the pole is negative. 

• A complex pole s is represented by its real part and its imaginary part: 
s = Sr ^ j si^ and the impulse response of a transfer function presenting a 
complex pole will be 

y{t) = exp(sij t) exp(j s/ t) = exp(sij t) [cos(s/ t) + j sin(s/ t)] (1-112) 

The imaginary part is responsible for the oscillatory behaviour while the 
stability depends only on the sign of the real part. 

Conjugate complex poles with negative real part such as 53 and 53 (Fig. 
1.21) result in damped oscillatory behaviour: they are stable poles. The 
closer the poles are to the imaginary axis, i.e. the nearest to 0 their real 
part is, the slower the decay of their exponential part. 

When poles are complex conjugate with positive real part such as 5 e and 
Sq (Fig. 1 . 21 ), they induce an oscillatory undamped response: they are 
unstable poles. 

• A pole at the origin such as 54 results in a constant term. 
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• Pure imaginary poles 5 = ± j CJ such as 57 and 57 result in a sinusoidal 
response with a frequency uo equal to the imaginary part of the poles. Un- 
der these conditions the system is said to be on the verge of stability or 
marginally stable. 



1.10 Linear Systems in State Space 



1.10.1 General Case 



Consider the single variable linear system (single-input single-output). Its 
state-space model is 



x{t) = Ax{t) -\- B u{t) 

y{t) = C x(t) D u(t) 



(1.113) 



where A, C, D are matrices of respective dimensions nxn,nxl, Ixn 
and 1 x 1 . The initial state of the system is a:^(0). For a system which can be 
represented by a strictly proper transfer function, i.e. whose numerator degree 
is strictly lower than the denominator degree, the matrix D is zero. This is the 
case for most physical systems. 

As this system is linear, it is possible to apply the Laplace transformation 



sX{s)-x{0) = AX{s)aBU{s) ^ {sI-A)X{s) =x{0)aBU{s) (1.114) 

where I is the identity matrix of dimension n x n. Provided that the matrix 
{s I — A) is invertible, the Laplace transform of the states can be obtained 

X(s) = {s I - A)-^ x(0) + {s I - A)-^ B U{s) (1.115) 



and the Laplace transform of the output is 

y(s) = CX{s) +DU{s) = C {s I - A)-^ x{0) +[C{sI - A)~^ B + D] U{s) 

(1.116) 

This response is composed of two terms, the hrst called response for a zero 
input and the second called response for a zero state. 

Given 

exp(At) = [{s I — A)~^] (1.117) 

the output response can be deduced from (1.116) 

y{t) = C exp{At) x{0) C exp{At) f e-xp{—AT)Bu{r)dT-\-Du{t) 

Jo 

(1.118) 

and the state of the process is given by 

x(t) = exp{At) x{0) e-xp(At) / exp{—Ar)Bu{r)dr 

Jo 



(1.119) 
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To obtain y{t) as in (1.118), it is possible to integrate the differential equa- 
tion of the system given by (1.113) by using the following properties of the 
matrix exponential 

^2 -j-Ti 

exp{At) = I + tA+ — + . . . + — A” + ... (1.120) 

2! n! 

^(exp(A t)) = A exp(A t) (1.121) 

The system transfer function is obtained, assuming that all initial conditions 
are zero, and consequently the initial states are zero: a:^(0) = 0. The output 
Laplace transform is 

Y{s)=[C{sI-A)-^B^D] U{s) (1.122) 

and the system transfer function is 

G{s) = C{sI -A)-^BaD (1.123) 

This transfer function can be written in a different form using the determinant 
“det” and the adjoint matrix “adj”: 

[adj(sJ-A)] B + H (1.124) 

The determinant of {si — A) is called the characteristic polynomial of A (di- 
mension n X n). Its n roots are the eigenvalues of A. 



1.10.2 Analog Representation 

First Case 



Consider a second-order system having the following transfer function 



Ci(5) 



K 

{ris + 1 )( t 25 + 1 ) 



(1.125) 



This transfer function is equivalent to the second-order ordinary differential 



equation 



nT2^ + {ti+T 2)^ + y = Ku(t) 



(1.126) 



which can be rewritten, in view of its further use in the analog block diagram, 
as 



fy 

dG 



aoy{t) - + bou{t) 



(1.127) 



with: 

ao = l/(riT2), ai = (ri +r2)/(riT2), bo = Kj{rxT^. 

Note that the above differential equation is in terms of the deviation va- 
riables in the same way as for the transfer function. Let us assume that at 
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Figure 1.22: Analog block diagram of the state-space representation 



time t = 0 the steady state prevails, y{t = 0) = ^(0) and {dy/dt)t=o = 

The analog circuit representation of the above ordinary differential equation 
is shown in Fig. 1.22. The input u{t) passes through a potentiometer (gain 
bo), a summator, an integrator and another summator (supplying the initial 
condition ^^(0)). The output from the second summator is passed through a 
second integrator and a third summator (supplying the initial condition ^(0)). 
The final output is y{t). 

Designating X 2 {t) as the output of the first integrator and xi{t) as the output 
of the second integrator, the above system can be represented by the following 
state-space model 



Xi(t) = X2{t) 

X2{t) = —aoXi{t) — aiX 2 {t) bou{t) ( 1 . 128 ) 

y{t) = xi{t) 



or, using matrix form 



Xi{t) 




_ X2{t) 





y{t) = [ 1 0 



1 



—ai 

Xi{t) 
_ X2{t) 



Xi{t) 

X2{t) 



+ 






( 1 . 129 ) 



where xi{t) and X 2 {t) are the system state variables. The number of state 
variables is equal to the order of the differential equation or the order of the 
transfer function. The state of the system at any time t depends only on the 
initial conditions at to and on the input u{t) between to and t. At initial time, 
the state variables are equal to the initial conditions and, in the following, they 
represent the evolution of the system. 



Second Case 

Consider a different second-order system having the following transfer function 

B{s) _ bos^ +bis + b2 

Ct2[S) — “TTW — o 

A(5) ao + ai s + a2 



( 1 . 130 ) 
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Process 



u(t) 1 


1 


z{t) 


B{s) 


1 y(t) 

1 ^ 


1 






1 ► 

1 



Figure 1.23: Representation of the partial state z{t) 



This transfer function is not strictly proper. It is equivalent to the second-order 
ordinary differential equation 

dy ^ , dPu , du , ,, , x 

^ ^ ^ = ^0 + bi — + 62 Su(t) (1.131) 

The transformation into an analog form is not as straightforward as in the 
first case. We introduce the partial state z{t) (Fig. 1.23) such that 

y(s) = S(s) Z(s) and Z{s) = ^—U{s) (1.132) 

A[s) 



As previously, we assume that u{t), z{t) and y{t) are deviation variables. From 
the previous equations, we deduce 



y{t) 

u{t) 



, d'^z ^ dz , , , 

d^z dz 

~ ^ ^ ^2 z{t) 



Set 

Xi{t) 

X2{t) 

The state representation follows 



dz 



dt 

z{t) 



(1.133) 



(1.134) 



Xi{t) 




— CLl — Oj2 


Xi{t) 


+ 




_ X2{t) 




ao ao 

_ 1 0 _ 


_ X2{t) 


1 



y(t) 



(,zi^ + h) 



- 6 o «2 , , X 
(— — + 62) 



ao ao 

which allows to easily obtain the analog representation. 



u{t) 

Xi{t) 

X2{t) 



+ — u{t) 

«0 

(1.135) 



1.11 Dynamic Behaviour of Simple Processes 

Initially, simple processes without a controller are considered (Fig. 1.24) and 
their open-loop behaviour is studied. 

Let us consider the response of the system to two types of inputs u(t): 
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U{s) 



G{s) 



Y{s) 



Figure 1.24: Open-loop block diagram of a process 



— A unit step: ix = 1 for t > 0 , ix = 0 for t < 0 ; the response of the system 
to this input is referred to as the step response. 

— A Dirac unit impulse: u = S (theoretical Dirac^); the response of the system 
to such an input is referred to as an impulse response. 

Let G{s) be the system transfer function subject to an input U{s). Except 
for possible time delays, G{s) is a rational fraction. For any physical input 
(impulse, step, ramp, sinusoidal, ...), U{s) can also be expressed as a rational 
fraction, therefore 



G{s) 



Dg{s) 



U{s) 



Nujs) 

Du{s) 



(1.136) 



The Laplace transform Y (s) of the output can be decomposed into 



Y(s) = G(s)U(s) 



^g(^) ^u{s) 



Nijs) N^js) 
D,{s) ^ Du{s) 



YnYYf 



Response = Natural response + Forced response 

(1.137) 

provided that the product (G{s) U (s)) is strictly proper and that denominators 
Dg{s) and the Du{s) have no common roots. 

The response yn{t) depends on the modes of G{s) and is called the natural 
response of the system, while yf{t) depends on the modes of U{s) (linked to 
the input type) and is referred to as the forced response of the system. 



^The Dirac function 6{t) is defined by physicists as 

6{t) = 0 Vt / 0 
(5(0) = +00 

which is the limit of a real pulse function centred around 0 with unit area (strength or energy) 
and zero duration. 

Mathematicians define the Dirac distribution such that the convolution of a function f{x) 
by the Dirac distribution is equal to 



f(x)*S(x) = f(x) 



An important property of a Dirac distribution is 




f(t)S(t-to) = f(to) 



The Dirac distribution is equal to the derivative of a unit step function (Heaviside function). 
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1.11.1 First-Order Systems 

A first-order system is described by a first-order differential equation of the 
form 

+y{t) = Kpu{t) (1.138) 

The corresponding transfer function is equal to 

G(.) = ^ (1.139) 

where r is the time constant and Kp is the steady-state gain, or asymptotic 
gain of the process. 

A first-order system can be represented by the block diagram shown in Fig. 
1.25. 




Figure 1.25: Block diagram of a first-order system 

If the input u of the process is a step function with amplitude A, the Laplace 
transform of the input is 

U{s) = 4 (1.140) 

Using the definition of the transfer function, the Laplace transform of the out- 
put is obtained 

y(s) = G(s) U{s) = 4 = _ 4^ = Yf{s) + Yn{s) (1.141) 

r s 1 s s r s I 

and the time domain response (Fig. 1.26) is 

y{t) = AKp{l -exp{-t/T)) (1.142) 

The forced and natural parts of the response are respectively equal to 

yf{t) = AKp ; y„(t) = -AKp exp{-t/r)) (1.143) 

With respect to the input of amplitude A, the asymptotic output (when t ^ oo) 
is thus multiplied by the gain of the process Kp. A first-order process is also 
called a “first-order lag”. 

The time constant r corresponds to the time necessary for the system res- 
ponse to reach 63.2% of its asymptotic value for a step input. After 2r, the 
response reaches 86.5% and after 5r it reaches 99.3% (Table 1.2). 

Several real physical systems have first-order dynamics. Examples of such 
systems are: 

— Systems storing mass, energy or momentum, 

— Systems showing resistance to the flow of mass, energy or momentum. 
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Figure 1.26: Response of a first-order system {Kp = l,r = 2) to a unit step 
function 

Table 1.2: Response of a first-order system to a unit step function expressed 
in percentage of the asymptotic value 



Time 


Percentage of the 
asymptotic value 


0 


0 


T 


63.21% 


2r 


86.47% 


3r 


95.02% 


4r 


98.17% 


5r 


99.33% 


6r 


99.75% 


7r 


99.91% 



1.11.2 Integrating Systems 

Integrating or pure capacitive processes are those whose dynamics only contain 
the hrst-order derivative of y{t) 

^ = K,u{t) (1.144) 

The corresponding transfer function is 

G(s) = ^ (1.145) 

The Laplace transform of the output of such a system to a step function with 
magnitude A is 

V(s) = (1.146) 

The time domain response y{t) (Fig. 1.27) is thus equal to 

y(t) = A Kpt 



(1.147) 
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The process is referred to as a “pure capacitive” or a “pure integrator”. The term 
“capacitive” signifies the accumulation of electrical charges, energy or mass. A 
surge tank can behave as a pure capacitive process. 




Figure 1.27: Response of a pure capacitive system {Kp = 1) to a unit step 
input 



1.11.3 Second-Order Systems 

A second-order system is described by a second-order differential equation writ- 
ten in the classical form as 

(1.148) 

with the corresponding transfer function 



G{s) 



Kp 



(1.149) 



where r is the natural period of oscillation of the system which determines the 
stabilization time of the system, ( is the damping coefficient and Kp is the 
steady- state gain of the system. 

The notions of natural period of oscillation and of damping factor are related to 
the damped or undamped oscillators. For C = 0? the expression (1.156) shows 
that the response to a step input oscillates continuously with a frequency 1 /r 
in radians/time unit. 

The transfer function of a second-order system is sometimes written as 



G{s) 



Kp 



5^ + 2 ^ UJfi s -\- uj‘^ 



(1.150) 



where = 1/r is the natural undamped frequency and cr = is the 

damping parameter. 

Several real physical processes exhibit second-order dynamics, among them 

are: 

— Two first-order systems in series. 
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— Intrinsic second-order systems, e.g. mechanical systems having an accele- 
ration. 

— Feedback or closed-loop transfer function of a hrst-order process with a PI 
controller. 

Note that the transfer function G{s) dehned by Eq. (1.149) has two poles, 
roots of: 5 ^ + 2 r 5 + 1 = 0, which are equal to 

N-C±yc2-1) if:C>l 

< Sj, — ^ 

- (-C \/l - C^) = ^n(-C - O = if: 0 < C < 1 

(1.151) 

If the natural period of oscillation r is hxed, then the position of the poles 
depends only on the damping coefficient The shape of the open-loop response 
to a given input is determined by the location of these poles on the 5-plane. 
For 0 < C < 1, the natural frequency is equal to the distance of the poles 
from the origin, the damped frequency uOa is equal to the distance of the poles 
from the real axis, and the damping parameter a is equal to the distance of the 
poles from the imaginary axis. 




Figure 1 . 28 : Normalized response of a second-order system to a unit step func- 
tion for different values of the damping coefficient ( (= 0.25; 1 ; 1.3 resulting 
in oscillatory underdamped response to overdamped response) {Kp = l,r = 1) 



If the input is a step function with magnitude A, the output Laplace transform 
is equal to 



Y{s) = A 



Kp 

s (r^ 5 ^ + 2 (“r 5 + 1) 



(1.152) 



which can be decomposed into 



y(s) 



AKp AKpT^s + 2(T 

s t'^s'^ + 2(ts + 1 



V(s) + y„(s) 



(1.153) 



The overall response consists of the forced and the natural responses 



y{t) = yf{t) + yn{t) 



(1.154) 
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The forced response is equal to 



~ AKp 



(1.155) 



and the overall response is 



y{t) 



AKp |l -exp(-CCT) 

if: 0 < C < 1 

AKp [1 - (1 + ^)exp( 

if: C = 1 

AKp |l -exp(-CCT) 
if: 1< C 




(1.156) 

The forced response is constant and equal to AKp while the natural response 
tends towards 0 when t ^ oo. The natural response takes into account the 
natural modes of the system and thus depends on the value of ( (Fig. 1.28): 



Rising time 




Figure 1.29: Response of a second-order system to a unit step input 

— For (“ > 1, there will be two real and distinct poles. The response is 
overdamped (multi-capacitive systems) with no overshoot. 

— For C = 1? there will be one multiple second-order pole. The response is 
critically damped, which corresponds to the faster overdamped response. 

— For 0 < C < 1, there will be two complex conjugate poles with negative 
real part. The response is underdamped. This response is initially faster 
than the critically damped and overdamped responses, which are sluggish; 
the drawback is the resulting overshoot. 
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With reference to the underdamped response of Fig. 1.29, the following terms 
are dehned: 

— Overshoot 

— 7T ( 

overshoot = — = exp( — , ) (1.157) 

B ^ ^ 

— Decay ratio = C/A^ = (overshoot)^ 

— Natural period of oscillation is dehned for a system with a damping coeffi- 
cient ( equal to zero. Such a system oscillates continuously with the natural 
period = 2 irr = 2ixjujn and the undamped natural frequency 

— Actual period of oscillation Ta^ which is the time between two successive 
peaks, characterized by its damped frequency uOa 



UJr 



= U>n a /1 - = 






2 7T 



(1.158) 



— Rise time: this is the time necessary to reach the asymptotic value for the 
hrst time 

1 



im. — 






yi-c 



: arctg — 



c 



(1.159) 



It can also be dehned as the time necessary to go from 10% to 90% of the 
asymptotic value and, in that case, it can be approximated (Goodwin and 
Sin, 1984) by 

(1.160) 



UJn 

— First peak reach time: the time necessary for the response to reach the hrst 
peak 

7T 7T 

^ (1.161) 

W„a/1 - 

— Settling time: time necessary for the response to remain in an interval 
between ±e (±5%, or ±2%) of the asymptotic value. For ±e = ±1%, 
according to Goodwin and Sin (1984), the settling time is 



7T 

tp — — 



4.6 

G - 7 — 

For (0 < C < 1), the time domain response can be written as 



(1.162) 



y{t) = AKp — AKp — exp(— crt) sm{iOdt + 0) 



(1.163) 



with 



6 = arccosC = arctan(i 






) = arcsin(\/l — (“^) 



(1.164) 



and the envelope of the undamped sinusoidal response is 



exp(— cr t) sm{iOd t 0) 



(1.165) 
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Figure 1.30: Normalized amplitude ratio for a sinusoidal input with varying 
damping coefficient ( between 0 and 1 per increment of 0.1 



The time domain response of a second-order system subjected to a sinusoidal 
input: u{t) = A sin(cjt), after the transient response decays, will take the form 



yoo{t) = 



Kp A 



sin(cjt — arctang 



and the normalized amplitude ratio is equal to 

EA„ = ^ 



1 — (ujt)^ 



which is maximum at a frequency cu^nax given by 



) (1.166) 



(1.167) 



Wmax = a/(1 - 2(^)/t 



(1.168) 



The normalized amplitude ratio has a maximum equal to l/(2(“y/l — for 
0 < C < 0.707. This maximum increases very quickly when ( becomes small 
(Fig. 1.30). 

Large oscillations are not desired, therefore small damping coefficients ( 
must be avoided. In controlled processes, a damping coefficient around ( = 1 
(conservative) or (“ = 0.7 (low overshoot, fast response), is often recommended. 



1.11.4 Higher-Order Systems 

Three types of higher-order systems will be described: 

— n First-order processes in series (multi-capacitive). 

— Processes with time delay. 

— Processes with inverse response. 
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n First-Order Processes in Series 

The transfer function of n first-order processes in series is obtained by multi- 
plying the transfer functions of n hrst-order systems 



C(s) — P 

i=l 



Kpi 
TiS + 1 



(1.169) 



Example 1.3: Tubular Plug Flow Reactor 

Consider a tubular plug flow reactor with a mean residence time equal to ri. 
In the absence of reaction, for a simple flow, the outlet concentration is equal 
to the inlet one, just delayed by the residence time ri. Suppose now that a 
hrst-order reaction A ^ B with a reaction rate va = k Ca is carried out in 
the reactor, the reactor being fed with a reactant stream at inlet concentration 
Cao' This represents a distributed-parameter system whose model is given by 
a partial differential equation. Another approach to model the reactor is to 
discretize it into n elementary reactors (Fig. 1.31) with a residence time given 
by 

r„ = ^ (1.170) 

The component mass balance on each element is given by 

= - [CA{i - 1) - CaA] -kCAii) (1.171) 

dt Tfi 



Cao 



Cau 



Cao 



Ca{i- 



cb 




Figure 1.31: Decomposition of a tubular reactor into a series of n continuous 
perfectly stirred tank reactors 



In the absence of reaction, the how is simply represented by n hrst-order sys- 
tems in series with unit gain 



^a(0 _ 1 

CA{i-l) TnS^l 



(1.172) 



The observed response at the reactor exit is similar to an overdamped system 
with a sluggish sigmoidal shape. Figure 1.32 represents the response of the 
system to a step increase in Caq from 0.4 to 0.5, using different numbers of 
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discretization elements n ranging from 1 to 100. Note that as n approaches 
100, the response looks like the input step with a delay time equal to the 
mean residence time of the tubular plug flow reactor (assumed 6 min. in this 
simulation). 




Figure 1.32: Response of the discretized plug flow reactor to a step increase 
in the inlet concentration from 0.4 to 0.5 with the number n of discretized 
elements, ranging from 1, 2, 3, 4, 5, 10 and 100 (in the absence of reaction) 




Figure 1.33: Response of the discretized plug flow reactor to a step increase in 
the inlet reactant concentration from 0.4 to 0.5 in the presence of a first-order 
reaction 



In the case where a chemical reaction occurs, each element represents a 
CSTR. Except in the simple case where the reaction is first-order, the model is 
in general nonlinear and would need a linearization around a steady state. For 
a first-order reaction, the transfer function is 



gA(0 ^ 1 

Ca(^-I) TnS^l^kTn 



(1.173) 



Note that the process gain is no longer unity due to the depletion of reactant by 
chemical reaction. The process response to a unit increase in the inlet concern 
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tration of reactant Caq from 0.4 to 0.5 in the presence of reaction is shown in 
Fig. 1.33. Due to the chemical reaction, the asymptotic outlet concentrations 
decrease with the number of elements of discretization in a similar manner to 
a tubular reactor. In comparison with Fig. 1.32, the response is more sluggish. 



Another example of a series of elementary systems resulting in an overall higher 
order system is the staged processes such as a tray distillation column. Each 
tray of a distillation column can be often considered as a hrst-order interacting 
process. 

A packed column (distillation, absorption or chromatography) can be mo- 
delled by partial differential equations and could be discretized in a similar 
manner to a tubular reactor. 

The dynamics of distributed-parameter systems and higher-order systems 
are often approximated by a first- or second-order overdamped model with time 
delay. For example, in a distillation column, a set of plates such as the stripping 
or the enrichment section is often identified as first-order with delay. 

Processes with Time Delay 

Time delay may be an inherent dynamic characteristic of a process or due to 
the measurement. In the former case, the process input does not immediately 
affect the process output. In the latter case, the measured signal received by the 
controller does not correspond to the contemporary process information and 
suffers from delay. A common example of time delay is the transportation lag, 
which may be either due to the process or measurement or both. Consider, for 
example, the concentration measurement in a reactor which frequently is not 
done in situ (Fig. 1.34). The measuring device is mounted on a sampling loop. 
The sample is pumped through the loop and experiences some transportation 
lag td to reach the sensor. 



1 






[> 




1 



(a) 




(b) 



Figure 1.34: Case (a): In situ sensor. Case (b): sensor placed in the exit 
pipe, inducing a transportation lag 

In the case of a distillation column, in general distillate and bottom concentra- 
tions are controlled by manipulating, for example, the refiux fiow and the steam 
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flow to the reboiler. Measurements are typically the levels in the reboiler and 
in the condenser, temperatures at different points of the column, and distillate 
and bottom concentrations. Temperature and level measurements can be con- 
sidered as instantaneous. This is not the case for concentration measurement, 
e.g. in the case of refineries, a chromatograph some distance away from the 
distillation column is often used. In such cases, the delay time will consist of 
a transportation lag to pump the sample from the process to the analyzer and 
an additional time for the analysis of the sample, which in the case of a chro- 
matograph could be in the order of several tens of seconds. A sample is taken 
from the distillation column at time ti and the result is available at time ^ 2 , 
where t 2 —t\ =td- During the measurement, the process continues to evolve. 




Figure 1.35: Representation of a system with time delay 

The time delay poses a problem in process control. The time delay between 
an input and an output (Fig. 1.35) means that the input variations have no 
immediate influence on the output. 

For a first-order system with a time delay, the transfer function linking the 
input u(t) and the delayed output y(t — td) is 

- id)] ^ Kp exp(-trfs) 

C[u{t)] r s -\- 1 

The exponential term is a nonlinear term. It is often approximated, for exam- 
ple, by a Fade approximation (here a first-order approximation), which converts 
the delay term to a rational fraction 



exp(-t^ s) 



1 td 

1 I 

l+2« 



{1.175) 



A more accurate approximation of the time delay is realized by the second- 
order Fade approximation. 



exp{-td s) 



1 , 
1 - — s + 



12 



1 



(1.176) 



Figure 1.36 shows the unit step response of a first-order system without time 
delay y{t), with exact time delay y{t — td), with first-order Fade approximation 
yi and with second-order Fade approximation ^ 2 - Note that the approximations 
are valid for times much larger than the time delay. Initially both first- and 
second-order approximations exhibit inverse response due to the zeros of the 
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Time 



Figure 1.36: Response of a first-order system {Kp = 2,r = 5) with a time 
delay of 20 s to a step input: y{t) (without time delay), y{t — td) (with time 
delay), yi (Fade hrst-order), y 2 (Fade second-order) 



rational transfer functions introduced by Fade approximations. The number of 
intersections of the approximated response with the time axis is equal to the 
order of Fade approximation and corresponds to the number of positive real 
zeros or complex zeros with positive real part of the transfer function, i.e. for 
the hrst-order Fade approximation, there is one real positive zero and for the 
second-order Fade approximation, there are two conjugate complex zeros with 
positive real part. 

There exists no perfect approximation for the time delay. However, digital 
computers handle time delays with relative ease, in particular in the case of 
digital control. 

Processes with Inverse Response 



Frocess 1 





Ki 




U{s) 




TiS + 1 


+1 




c 






K2 


-t 




T2S + 1 





y(s) 



Frocess 2 



Figure 1.37: Representation of a system with inverse response 

In a system with inverse response such as that represented in Fig. 1.37, the 
overall response yg (Fig. 1.38) initially moves in a direction opposite to its 
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final direction. This behaviour is caused by the addition of two opposing sub- 
processes. At the beginning, process 2, which has a smaller time constant, 
is dominant. Subsequently, process 1, which has a higher gain (iCi > 7^2 ), 
becomes dominant. The overall process is thus composed of two competitive 
processes having different time constants and different gains. 

To produce an inverse response, it is necessary and sufficient that the plant 
transfer function exhibits a positive real zero or a complex zero with a positive 
real part. For the example shown in Fig. 1.37, the overall transfer function will 
have a positive zero if 



T2 Ti 



(1.177) 




Time 



Figure 1.38: Inverse response of the system shown in Fig. 1.37 to a unit step 
input (iCi = 5, Ti = 20, K 2 = 1.5, T 2 = 2) 

Chemical processes which exhibit an inverse response are not rare. For example, 
an increase in the vapour boil-off (increase in the steam flow rate to the reboiler) 
in a distillation column results in an inverse response in the liquid level in the 
bottom of the column. The increase in the vapour boil-off, initially, decreases 
the liquid level in the bottom of the column. Subsequently, due to the increased 
vapour flow rate and frothing on the trays immediately above the reboiler, the 
liquid level may eventually increase. 

Another example is the response in the liquid level in a boiler due to an 
increase in the inlet water flow rate. The initial increase in the water flow 
rate decreases the liquid level because of the collapse of the vapour bubbles. 
However, if the steam production is maintained at a constant rate, the liquid 
level will eventually increase. 

A third example is the response in the exit temperature of a fixed-bed re- 
actor with exothermic reactions to a step increase in the feed temperature. If 
the feed temperature increases, the rate of reactions at the reactor inlet will 
increase and the reactants are consumed close to the reactor inlet. This will 
move the hot spot closer to the reactor entrance and consequently the exit re- 
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actor temperature decreases. However, eventually the reactor exit temperature 
will increase as a result of the increase in the inlet temperature. 

1.11.5 Process Identification in the Continuous Domain 

General Principles 

It is possible to develop a model for an existing process using only the input- 
output data. This empirical technique for developing process models is referred 
to as process or system identihcation (Sinha and Rao, 1991; Unbehauen and 
Rao, 1987; Walter and Pronzato, 1997; Young, 1981). System identihcation for 
complex processes requires less engineering effort compared to the theoretical 
model development. Of course, the application field of such an identified model 
is more limited. In open-loop system identification, the controller is switched 
to manual, the controller output is changed by a step function or a series 
of pseudo-random-binary-signal (PRBS) or any other exciting input sequence, 
and the process response is monitored. The input-output data are used to 
develop the system model. This model will represent the dynamics of the 
combination of the final control element (the control valve), the process, and 
the measuring element. Measurement noise will generally be superposed on 
the actual process response. Sophisticated process identification techniques are 
capable of distinguishing the process model from the noise model. 

Often, a low-order linear model based on the a priori information of the 
process is assumed, i.e. the structure of the model is fixed. Under these con- 
ditions, process identification is reduced to the determination of the unknown 
model parameters, i.e. a parameter estimation problem. Let us assume that 
the process model is given by 



y{t)=f{t,0) (1.178) 

where 0 is the unknown parameter vector. In order to determine the unknown 
parameter vector, one may use the least-squares technique by minimizing the 
following objective function, which is the sum of the squares of errors between 
the measured output yi at time U and the output predicted by the model yi 

J{e)=Y,{yi-yif (1.179) 

i 

Other criteria can be introduced in particular by the use of weighting factors. 

Two main cases are distinguished depending on whether the model is linear 
or nonlinear. In the case of a linear model with respect to m parameters Oj, 
the model can be written as 



y(®) = ( 1 - 180 ) 

J = 1 

A set of n input-output observations can be collected in a matrix ^ such that 
^ij = (j)j{xi) ; 1 < i < n ; I < j < m (1.181) 
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The parameter vector estimated by minimization of the criterion (1.179) is 
equal to 

0 = (1.182) 

where y is the measured vector. 

Example 1.4: Identification of a Linear Model 

Consider the example of determining the parameters in the model of the heat 
capacity of a fluid expressed with respect to temperature by 

Cp = a + hT + cT^ (1.183) 



This model is nonlinear with respect to temperature but linear with respect 
to the coefficients a, 5, c. As we wish to determine the coefficients, we must 
realize at least three experiments. In the case where only three experiments are 
performed, the coefficients are the solution of a perfectly determined system 
which can be solved by the Gauss method, for example. In the case where more 
than three experiments are performed, it becomes a least-squares problem. 
Assume that we perform n > 3 experiments which give heat capacities Cp^i at 
temperatures Ti. The matrix and vector y of Eq. (1.182) are equal to 





" 1 


Ti 


T2 - 




■ Cp,i ■ 


4 > = 


1 


T2 


T| 


; y = 


Cp,2 




_ 1 


Tn 


rji2 




Cp^n 



The vector of estimated parameters is then 



d 

b 



c 






(1.185) 



In the case of a nonlinear model, which is most common in practice, more 
general optimization methods such as direct search or gradient-type methods 
such as the generalized reduced gradient method or the quasi-Newton method 
(Fletcher, 1991; Gill et ah, 1981), must be employed. Powerful algorithms 
exist for solving such problems as the BEGS method (Byrd et ah, 1995; Zhu 
et ah, 1994) or sequential quadratic programming (SQP) under NLPQL form 
(Schittkowski, 1985). It is also possible to linearize the non-linear model with 
respect to the parameter vector and then use a linear parameter estimation 
method. 

Example 1.5: Identification of a Nonlinear Model 

Gonsider the example of determining the parameters in the model of the sat- 
urated vapour pressure of a fluid expressed with respect to temperature by 
Antoine’s law 

B 



In(Psat) = A 



C + T 



(1.186) 
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We can make remarks similar to the previous case. This model is nonlinear with 
respect to the temperature and nonlinear with respect to the coefficients A, 

C . In the case where only three experiments are performed, the coefficients are 
the solution of a perfectly determined system which is now nonlinear and could 
be solved by the Newton-Raphson method (Carnahan et ah, 1969). In the case 
where more than three experiments are performed, it becomes a least-squares 
problem. Assuming that we make n experiments which give vapour pressures 
Pi at temperatures Ti^ a least-squares criterion to be minimized is written as 



n 

j = J2iHPi)-A + 

i=l 



B 

CPT, 






(1.187) 



whose minimization with respect to parameters can be performed by a quasi- 
Newton method. 



A First-Order Model 

If a time delay is present, hrst it must be estimated and then the system is 
examined without this delay. The steady-state gain Kp is also estimated from 
the asymptotic response. The time constant r can be evaluated by using Table 
1.2, for example, by searching the time at which 63.2% of the asymptotic value 
is reached. For a step change with magnitude A in the input, the time domain 
response is given by 

ln(l-^) = -l (1.IS8) 

which corresponds to a straight line with a slope — 1/r. 



A Second-Order Model 



Similar to a hrst-order model, the time delay must be estimated hrst, then the 
steady- state gain Kp. 

If the second-order system is overdamped, its transfer function can be writ- 
ten as 



G{s) 



Kp 

(ri5 + l)(T2 5 + l) 



(1.189) 



with two unknown time constants. Harriott’s method offers a simple graphi- 
cal method for the determination of the unknown time constants based on the 
measurements of process outputs at two points during its evolution. A more ac- 
curate method is the numerical non-linear optimization using the entire process 
output response to a step change in the input 



y{t) = A Kp 



1 + 



n 

T 2 -Ti 



-exp(-t/ri) + 



T2 
Tl - T2 



-exp(-t/T 2 ) 



(1.190) 
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When plotted versus time, this function has a sigmoidal shape with an inflection 
point at 

t= ln(— ) (1.191) 

Tl -T2 T2 

If the second-order system is underdamped, its transfer function is 



G{s) 



Kp 

r‘^ s‘^ -\- 2 (rs -\- 1 



(1.192) 



Two parameters must be estimated, the damping coefficient and the time con- 
stant r. Graphical methods can be used to determine the decay ratio or the 
overshoot to estimate the unknown parameters. 

However, a better approach is the nonlinear optimization using the res- 
ponse given by Eq. (1.156) to minimize a criterion such as the one given in 
Eq. (1.179). Consider the response of the model denoted by ymod,i and the ex- 
perimental response denoted by tjexpy at n different instants ti. The nonlinear 
optimization problem is expressed as 



min 

r,C,^P 



n 

exp,i ymod,i) 

i=l 



(1.193) 



which again can be solved by a quasi-Newton-type method. The response of 
the model depends analytically on the parameters and the criterion can be 
analytically differentiated. If this differentiation seems too difficult, it may be 
performed numerically. 



The Method of Moments 



The advantage of the method of moments is that it can be used with any type 
of input. 

This method is based on the definition of the Laplace transform of the 
impulse response g{t) of a system, which is its transfer function 



(1.194) 



pOO 

G{s) = / exp{—st)g{t)dt 
Jo 

As the n-th-order moment of a function f{x) is defined by 

pOO 

Mn{f) = / x^f{x)dx (1.195) 

Jo 

it can be noticed that the first two derivatives of G{s) with respect to s 

pOO pOO 

G\s) = — t e-x.p{—st)g{t)dt ; G'\s) = / e-x.p{—st)g{t)dt 

Jo Jo 

(1.196) 

are related to the moments of the impulse response function by 



pOO pOO pOO 

^(0) = / g{t)dt ; G'(0) = — / tg{t)dt ; G"{0) = / g{t) dt 

Jo Jo Jo 

(1.197) 
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Thus (T(0), — (T'(0), (T"(0) are respectively the zero-, hrst- and second-order 
moments of the impulse response g(t). 

Note that the above three integrals can be calculated by using the measured 
output response. Consider the following systems for the application of this 
method: 

• A hrst-order model with time delay 



G{s) 



Kp exp{-td s) 
r 5 + 1 



(1.198) 



giving 

G{0) = Kp ; G\0) = -Kp{T+td) ; G"{0) = Kp {2 t^ + 2rtd + 4) 

(1.199) 

from which the three unknown parameters Kp, t and td can be determined. 

• An overdamped second-order model with time delay (previously determined) 



G{s) — 



Kp exp(-trf s) 

(ti S 1) (t2 S + 1) 



from which 



G(0) =Kp ; G\0) = -Kp{Ti+T2+td) 

G"{Q) = Kp [(ti + T2 + td4 + '^1 + '^1] 



( 1 . 200 ) 



( 1 . 201 ) 



The time delay can be obtained by inspection of the response curve and the 
other parameters are obtained from the moments. 

• An underdamped second-order model with time delay (previously deter- 
mined) 

Kp exp(-td s) 



G{s) — 



s“^ -|-2((ts-|-1 

from which 

G(0) =Kp ; G'{0) = -Kp{2CT + td) 

G"(0) =Kp[4 + 4CTtd-2T^+8eT4 

Note that in this method, it is possible to use any type of input. We have, 
in general 



( 1 . 202 ) 

(1.203) 



T(s) = G(s) U{s) ; Y'{s) = G'(s) U{s) + G(s) U'{s) 

Y"{s) = G"(s) U{s) + G(s) U"{s) + 2 G'(s) U'{s) 

from which the following equations are deduced 

T(0) = G(0) C/(0) ; T'(0) = G'(0) C/(0) -f G(0) C/'(0) 

T"(0) = G"(0) C/(0) -f G(0) C/"(0) -f 2 G'(0) C/'(0) 

These quantities can be calculated by the following equations: 

pOO pOO pC 

U{0)= / u{t)dt ; u\0) = - tu{t)dt ; C/"(0) = 

Jo Jo Jo 



(1.204) 



(1.205) 



uit) dt 
(1.206) 
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and 

pOO pOO pOO 

^(0) = / y{t) dt ; F'(0) = — / ty{t)dt ; F"(0) = / y{t) dt 

Jo Jo Jo 

(1.207) 

In order to determine the characteristic parameters of the system, the mo- 
ments method uses the entire input and output curves and can be applied to 
any type of the input signal. This method should be preferred to any method 
based on only two given points of an output response. 
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Chapter 2 

Linear Feedback Control 



The by far most used control method in industry is the proportional-integral- 
derivative or PID controller. It is currently claimed that 90 to 95% of industrial 
problems can be solved by this type of controller, which is easily available as 
an electronic module. It allies an apparent simplicity of understanding and 
a generally satisfactory performance. It is based on a quasi-natural principle 
which consists of acting on the process according to the error between the set 
point and the measured output. Indeed, along the chapters of this hrst part, 
it will appear that numerous variants of PID exist and that improvements can 
often be brought either by better tuning or by a different configuration. 



2.1 Design of a Feedback Loop 

2.1.1 Block Diagram of the Feedback Loop 

Feedback control consists of a reinjection of the output in a loop (Fig. 2.2). The 
output response y or controlled variable is used to act on the control variable 
(or manipulated input) u in order to make the difference {yr — y) between the 
desired or reference set point yr and the output y as small as possible for any 
value of any disturbance d. The output y is linked to the set point yr by a 
system which forces the output to follow the set point. 

If a fixed value of the set point is imposed, the system is said to be regulating 
or in regulation mode; if the set point is variable (following a trajectory), the 
system is said to be tracking the set point or in tracking mode or subjected to 
a servo-mechanism. The trajectory tracking is often met, e.g. in the case of 
batch reactors in fine chemistry, a temperature or feed profile is imposed or, in 
the case of a gas chromatograph, a temperature profile is imposed on the oven 
temperature. 

Open-Loop System: 

The vocabulary open or closed loop, comes from the electricity domain. Thus, 
an electrical circuit must be imagined, open when it is an open loop, closed for 
a closed loop. 
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d{t) 



u{t) 



Process 



y{t) 



Figure 2.1: Process representation in open loop 

In the open loop (Fig. 2.1), the output value is not used to correct the error. 
The open loop can work (theoretically) only if the process model is perfect 
and in the absence of disturbances: in practice, many phenomena such as 
measurement errors, noise and disturbances are superposed so that the use of 
the open loop is to be proscribed. However, feedforward control (Sect. 6.6) 
is a special open-loop design to counterbalance the measured disturbances; in 
many cases, it is coupled with feedback control. 

Closed-Loop System: 



Disturbance 




Figure 2.2: Representation of a closed-loop process 

The process of Fig. 2.2 presents an output y, a disturbance d and a control 
variable u. In general, the shape of disturbance is unpredictable and the ob- 
jective is to maintain the output y as close as possible to the desired set point 
yr for any disturbance. A control possibility is to use a feedback realized by a 
closed loop (Fig. 2.2): 

— The output is measured using a given measurement device; the value indi- 
cated by the sensor is ym- 

— This value is compared to the set point giving the difference [set point 
— measurement] to produce the error e = yr — ym- 

— The value of this difference is provided to the main corrector, the function 
of which is to modify the value of the control variable u in order to reduce 
the error e. The corrector does not operate directly, but through an actuator 
(valve, transducer ...) to which it gives a value Ua- 

Important remark: one acts on the actuator and modifies the control variable u 
only after having noted the effect of the disturbance on the output. The set of 




Process Control 



67 



the comparator and the corrector constitutes the control system and is called 
a controller which can perform regulation actions as well as tracking. 



2.1.2 General Types of Controllers 

A controller can take very different forms. In reality, it represents a control 
strategy, that is to say, a set of rules providing a value of the control action when 
the output deviates from the set point. A controller can thus be constituted 
by an equation or an algorithm. 

In this first stage, only simple conventional controllers are considered. 



Proportional (P) Controller 

The operating output of the proportional controller is proportional to the error 



^(0 ^ab 



( 2 . 1 ) 



where Kc is the proportional gain of the controller. 

Uab is the bias signal of the actuator (= operating signal when e{t) = 0), 
adjusted so that the output coincides with the desired output at steady state. 

The proportional controller is characterized by the proportional gain Kc, 
sometimes by the proportional band PB defined by 



100 






Kc 



( 2 . 2 ) 



in the case of a dimensionless gain. In general, the proportional band takes 
values between 1 and 500; it represents the domain of error variation so that 
the operating signal covers all its domain. The higher the gain, the smaller the 
proportional band, and the more sensitive the controller. The sign of gain Kc 
can be positive or negative. Kc can be expressed with respect to the dimensions 
of output Ua{t) and input e{t) signals, or dimensionless according to the case. 

A controller can saturate when its output Ua{t) reaches a maximum Ua, max 
or minimum iXa,min value. 

The controller transfer function is simply equal to the controller gain 



^c(5) = Kc 



(2.3) 



In the following, it will be noticed that the proportional controller presents 
the drawback to creating a deviation of the output with respect to the set point. 



Proportional-Integral (PI) Controller 



The operating output of the PI controller is proportional to the weighted sum 
of the magnitude and of the integral of the error 



Ua{t) = Kc e(t) + 



-/■ 

K Jo 



e(x) dx + Uab 



(2.4) 
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For chemical processes, the integral time constant is often around 0.1 < r/ < 60 
min. 

The integral action tends to modify the controller output Ua{t) as long as an 
error exists in the process output; thus an integral controller can only modify 
small errors. The transfer function of the PI controller is equal to 

Gc{s) = Kc 

Philosophy: the integral action takes into account (integrates) the past. 

Compared to the proportional controller, the PI controller presents the ad- 
vantage of eliminating the deviation between the output and the set point 
owing to the integral action. However, this controller can produce oscillatory 
responses and diminishes the closed-loop system stability. Furthermore, the 
integral action can become undesirable when there is saturation: the controller 
acts at its maximum level and nevertheless the error persists; the phenomenon 
is called windup. In this case, the integral term increases largely, possibly with- 
out limitation, and it is necessary to stop the integral action. An anti-windup 
device must be incorporated into PI controllers (see Sect. 4.6.4). 



Ideal Proportional-Derivative (PD) Controller 

The operating output of the ideal PD controller is proportional to the weighted 
sum of the magnitude and the time rate of change of the error 

Ua{t) = Kc ^e(t) + + '^ab (2.6) 

The derivative action is intended to anticipate future errors. The transfer 
function of the ideal PD controller is equal to 

Gc{s) = Kc (I + td s) (2-7) 

This controller is theoretical because the numerator degree of the controller 
transfer function Gc{s) is larger than the denominator degree; consequently, it 
is physically unrealizable. In practice, the previous derivative action is replaced 
by the ratio of two first-order polynomials presenting close characteristics for 
low and medium frequencies. Furthermore, an integral action is always added. 
The derivative action has a stabilizing influence on the controlled process. 
Philosophy: the derivative action takes into account (anticipates) the future. 



Ideal Proportional-Integral-Derivative (PID) Controller 



This type of controller is the most often used, however, in a slightly different 
form from the ideal one which will be first presented. The operating output of 
the ideal PID controller is proportional to the weighted sum of the magnitude, 
the integral and the time rate of change of the error 



^a(0 — 



e{t) + td 



de{t) 

dt 




+ Uab 



( 2 . 8 ) 
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The transfer function of the PID controller is equal to 



Ge(s) = Kc 



1 + T£) s H 

TjS 



(2.9) 



The previous remark on the physical unrealizability of the derivative action is 
still valid. 

Philosophy: owing to the derivative action, the PID controller takes into ac- 
count (anticipates) the future, and owing to the integral action, the PID takes 
into account (integrates) the past. 

Remark 1: 

The previous theoretical controller is, in practice, replaced by a real PID con- 
troller of the following transfer function 



^c(^) = K, 



TjS + 1 

TjS 



TpS + 1 
(3tds + 1 



(2.10) 



which is physically realizable. 

Remark 2: 

It is often preferred to operate the PID controller by making the derivative ac- 
tion act no more on the error coming from the comparator but on the measured 
output, under the theoretical form 

Ua{t) = Kc (^e{t) + ^ J e(x) dx - td (2-H) 

or practically 

C/,(s) =p(^l + E{s) - y^is) (2.12) 

This method of taking into account the derivative action allows to avoid brutal 
changes of the controller output due to the error signal variation. 



2.1.3 Sensors 

This point may seem simple; indeed, in a real process, it is an essential element. 
Without good measurement, it is hopeless to control the process well. The 
sensor itself and the information transmission chain given by the sensor are 
concerned. The common sensors that are met on chemical processes are: 

— Temperature sensors: thermocouples, platinum resistance probes, pyrome- 
ters. Temperature sensors can be modelled from the response they give to 
a temperature step according to a hrst- or second-order model, sometimes 
with a time delay. 

— Pressure sensors: classical manometers using bellows. Bourdon tube, mem- 
brane or electronic ones using strain gauges (semiconductors whose resis- 
tance changes under strain). Diaphragm pressure sensors use detection of 
the diaphragm position by measurement of electrical capacitance. They are 
often represented by a second-order model. 
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— Flow rate sensors: for gas flow such as thermal mass flow meters (based 
on the thermal conductivity of gases, the gas flow inducing a temperature 
variation in a capillary tube), variable area flow meters (displacement of a 
float in a conical vertical tube); for liquid flow such as turbine flow meters 
(rotation of a turbine), depression flow meters as venturi- type flow meters 
(the flow rate is proportional to the square root of the pressure drop), vortex 
flow meters (measurement of the frequency of vortex shedding due to the 
presence of an unstreamlined obstacle), electromagnetic flow meters (for 
electrically conducting fluids), sonic flow meters, Coriolis effect flow meters. 
Flow rate sensors have very fast dynamics and are often modelled by an 
equation of the form 

flow rate = ay^ AP 

where the proportionality constant a is dependent on the sensor and AP 
is the pressure drop between the section restriction point and the outlet. 
These signals are often noisy because of flow fluctuations and should be 
filtered before being used by the controller. 

— Level sensors: floats (lighter than the fluid), displacement (measurement 
of the apparent weight of a half-submerged cylinder) through a pressure 
difference measurement, conductivity probes indicating liquid presence, ca- 
pacitance detectors for level variations. 

— Composition sensors: potentiometers (chemical potential measurement of 
an ion by means of a specific electrode), conductimeters (measurement of 
a solution conductivity), chromatographs (separation of liquids or gases), 
spectrometers (visible, UV, infra-red, ...). Among these, chromatographs 
pose a particular and very important problem in practice: the information 
provided by these apparatus arrives a long time after the sampling and thus 
there exists a large time delay that must be included in the model. This 
time delay can be the cause of a lack of mastering or imperfect mastering 
of the process control. 



In the absence of a measurement concerning a given variable, if a model of the 
process is available, it is possible to realize a state observer called a software 
sensor. This latter will use other available measurements to estimate the value 
of the unmeasured variable, it is comparable to an indirect measurement. The 
linear Kalman filter or nonlinear (extended Kalman filter) is often used for this 
purpose (see Sect. 11.1.2 and 18.4.3). Chemical composition estimations are 
particularly concerned by this type of sensor. 

The transmitter is the intermediary between the sensitive element of the 
sensor and the controller. It is a simple converter which is then considered as 
a simple gain, ratio of the difference of the output signal (often transmitted in 
the range 4-20 mA) over the difference of the input signal given by the sensor. 

The set sensor-transmitter can be considered as a global measurement de- 
vice. 
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2.1.4 Transmission Lines 

Traditionally, transmission lines were pneumatic. Nowadays, more and more 
often they are electrical lines. In general, their dynamic influence on the pro- 
cess is neglected except in the case of very fast phenomena, which is not very 
common in chemical engineering. 

2.1.5 Actuators 

Actuators (Considine, 1999) constitute material elements which allow action by 
means of the control loop on the process. For example, as a flow rate actuator, 
a very common element is the pneumatic valve, operating as indicated by its 
name with pressurized air. It could as well be mechanically operated by a dc 
or a stepping motor (Fig. 2.3). 




Figure 2.3: Scheme of a typical sliding stem valve 

A valve is designed to be in position, either completely open (fail open, or air- 
to-open) or completely closed (fail closed, or air-to-close) when the air pressure 
is not ensured, which can happen in the case of an incident in the process. 
Consider the case of a valve closed in the absence of pressure: when the air 
pressure increases on the diaphragm, the spring is compressed and the valve 
plug pulls out from its seat, thus increasing the passage section for the fluid, 
hence the flow rate. The inverse type of valve (open in the absence of pres- 
sure) exists where the pressure increase makes the valve plug go down (either 
because of the position of the air inlet with respect to the diaphragm or be- 
cause of the disposal of the seat and the valve plug) and thus the cross-section 
decreases. The choice of valves is generally made by taking into account safety 
rules. There also exists motorized valves: rotating valves (butterfly, ball). In 
general, the valve dynamics is fast. It must not be forgotten that the valve 
introduces a pressure drop in the pipe. With respect to control, a valve should 
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not be operated too close to its limits, either completely open or completely 
closed, where its behaviour will be neither reproducible nor easily controllable. 
Frequently, a valve has a highly nonlinear behaviour on all its operating range 
and it is necessary to linearize it piecewise and use the constructed table for 
the control law. 

For liquids, the flow rate Q depends on the square root of the pressure drop 
APy caused by the valve according to 



Q — Cy 



AP 

d 



(2.13) 



where d is the fluid density (with respect to water) and Cy is a flow rate 
coefficient such that the ratio {CyjD'^) is approximately constant for liquids 
for a given type of valve, Dy being the nominal valve diameter. Viscosity 
corrections are required for Cy in the case of viscous liquids. 

For gases or vapours, when the flow is subsonic, the volume gas flow rate is 

Q = 0.92 0.148 y") (2.14) 

Y ^ up 



where Q is given in m^.s“^ at 15 °C under 1 normal atm. Pup is the upstream 
pressure (in Pa), Tup the upstream temperature, Y the dimensionless expansion 
factor and dg the gas density (with respect to air) . (7/ is a dimensionless factor 
which depends on the type of fittings of the valve and ranges from 0.80 to 0.98. 
Y is equal to 



1.63 AP 



(2.15) 



When Y < 1.5, the flow is subsonic; when Y > 1.5, the flow is sonic, i.e. 
choked. 

When the flow is sonic, the volume gas flow rate is 



Q = 0.92CfCyPup 



\/ dg 



up 



(2.16) 



The ratio q of the real flow rate Q to the maximum flow rate Qm 



Q 



Q_ 

Qm 



(2.17) 



can depend on the aperture degree x of the valve in several ways (Fig. 2.4). 
Denoting the sensibility as cr = dq/dx^ the latter can be constant (linear be- 
haviour: case 1), or increase with x (case 2), or decrease with x (case 3), or 
increase then decrease (case 4) (Midoux, 1985). Often, three main types of 
valves are distinguished (Thomas, 1999): with linear characteristics, with but- 
terfly characteristics and with equal percentage characteristics. Denoting the 
position of the valve positioner by x, the equations for the valve constant are 
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respectively 



Linear: 

Butterfly: 

Equal percentage: 



a 

a 

a 






= Cys (1 - cos(|^x)) 

-nx—1 

— ^VS 



(2.18) 




Figure 2.4: Influence of the aperture degree of a valve on its flow rate 



2.2 Block Diagrams, Signal-Flow Graphs, 
Calculation Rules 

The study of feedback control for single-input single-output processes is per- 
formed in this chapter using Laplace transfer functions. It would be possible to 
do the same technical realizations and their theoretical study based on state- 
space modelling. On the other hand, the theoretical discussion and mathe- 
matical tools would be completely different. A sketch of the state-space study 
nevertheless will be presented. 

When specialized packages for solving control problems (e.g. MATLAB©) 
are used, it is very easy to hud the state-space model equivalent to a transfer 
function. Furthermore, it is possible to set blocks in series or in parallel, to do 
feedback loops, either with transfer functions or in state space, and then to re- 
alize a complete block diagram in view of a simulation. However, an important 
difference exists between both approaches. When the studied system becomes 
complicated, the numerical solving based on transfer functions gives worse and 
maybe erroneous results, compared to the complete state-space solving. The 
reason is in the far more direct approach of the phenomena in state space and 
their direct numerical solving. 

Given a block diagram in which each block represents a transfer function, 
the output of any block must be calculated with respect to the input of any 
other block. Beyond the blocks of transfer functions, the block diagram uses 
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summators, which do the algebraic addition of inlet signals, and signal dividers, 
which separate a signal into two or several signals of same intensity. Most of 
the common cases are represented in Figs. 2.5 to 2.11 and results are given in 
the transfer function case and in the state-space case. 



Calculation rules with Laplace transform 



Ui{s) 



+ 



U2{S) 



+ 



Ms) 



+ 



Y{s) 



Figure 2.5: Block scheme number 1 

For block scheme number 1 (Fig. 2.5), which contains two summators, the 
Laplace transform equation is 



Y{s) = U,{s)^U2{s)-Us{s) 



(2.19) 



U{s) 



G{s) 



Y{s) 



Figure 2.6: Block scheme number 2 

For block scheme number 2 (Fig. 2.6), which contains only one transfer func- 
tion, the Laplace transform equation is 

F(s) = GU{s) (2.20) 




Figure 2.7: Block scheme number 3 under two equivalent representations 

For block scheme number 3 (Fig. 2.7), which contains one transfer function 
and a summator, the Laplace transform equation is 



F(s) =G(C/i(s) -C/2(s)) 



( 2 . 21 ) 
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U{s) 



G{s) 



y2{s) 



Yiis) 




Figure 2.8: Block scheme number 4 under two equivalent representations 



For block scheme number 4 (Fig. 2.8), which contains one transfer function 
and a signal divider, the Laplace transform equations are 

Y,{s) = GU{s) ; Y2{s) = U{s) (2.22) 




Figure 2.9: Block scheme number 5 under two equivalent representations 

For block scheme number 5 (Fig. 2.9), which contains two transfer functions 
in series, the Laplace transform equation is 

r(5) = GiG2U{s) (2.23) 




Figure 2.10: Block scheme number 6 under two equivalent representations 



For block scheme number 6 (Fig. 2.10), which contains two transfer functions 
in parallel and a summator, the Laplace transform equation is 

r(s) = (Gi - G 2 ) U{s) (2.24) 



For block scheme number 7 (Fig. 2.11), which contains two transfer functions 
in a feedback loop containing a summator with the feedback of sign e (e = +1 
for a positive feedback, e = — 1 for a negative feedback), the Laplace transform 
equation is 



y(s) 



Gi 

1 + e G1 G 2 



U{s) 



(2.25) 
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Figure 2.11: Block scheme number 7 under two equivalent representations 



Calculation rules in state space 

In state space, signals are directly considered with respect to the time varia- 
ble and each system or block number i is represented by the set of matrices 
Bi^Ci^ Di). Recall that if a system can be represented by a strictly proper 
transfer function, the matrix Di is zero and this is the case of most physical 
systems. Equations are given in the case of single-input single-output systems. 
Block scheme number 1 



y{t) = Ui{t) + U2{t) - U3{t) 

Block scheme number 2 

( x(t) = Ax(t) B u(t) 
[ y{t) = C x{t) D u{t) 

Block scheme number 3 

J x{t) = Ax{t) A B 
[ y{t) = C x{t) A D 
Block scheme number 4 

x(t) = Ax(t) B u(t) 
< yi{t) = C x{t) -\- D u{t) 

^ V 2 {t) = U{t) 



(2.26) 



(2.27) 



(2.28) 



(2.29) 



Block scheme number 5, two systems in series: equations for each 
following 



Xi{t) = 


Ai xi{t) + Bi u{t) 


yiit) = 


Cl xi{t) + Di u{t) 


X 2 (t) = 


A2 X2(t) + B2 yi(t) 


y{t) = 


C2 X2(t) + D2 yi(t) 



Defining the global state vector, union of both state vectors: 



block are the 



(2.30) 



x{t) 



Xi{t) 

X2{t) 



(2.31) 




Process Control 



77 



one obtains for two systems in series 
x{t) = 



±l{t) 




_ X2it) 





Ai 0 

B 2 Cl A 2 



x{t) + 



Bi 

B 2 Di 



u{t) 



(2.32) 



y(t) = [ D 2 Cl C 2 ] x(t) + D 2 Di u(t) 

Block scheme number 6, two systems in parallel 



x(t) = 



Xi{t) 

X2{t) 



Ai 0 
0 A 2 



x(t) + 



Bi 

B 2 



u(t) 



(2.33) 



[ y{t) = [ Cl -C 2 ] x{t) + {Di - D 2 ) u{t) 



Block scheme number 7, feedback loop of sign e: 

the general case where D\ and D 2 are not zero is hrst treated. 

The basic equations are the following 



Xl{t) = 


Ai xi{t) + Bi ui{t) 


y{t) = 


Cl xi{t) + Di ui{t) 


X2(t) = 


A 2 X2(t) + B2y(t) 


y2{t) = 


C 2 X2(t) + D2y(t) 


Ui(t) = 


u{t) + ey2it) 



(2.34) 



Eliminating internal variables ui(t) and ^ 2(^5 obtains 
x{t) = 



Xi{t) 

X2{t) 



Ai + eBiD2Ci [I - eDiD2\~" 

B2C1 [I-eDiD 2 ]~^ 

Bi + e Bi D 2 Di [I — e Di D 2 

B2D1 [I-eDiD 2 ]~^ 

Ci[I -eDiD2]-^ 

eDiC2 [I-eDiD2]~^ 



+ 

y{i) = 



eBiC2 + B 1 D 2 D 1 C 2 [I — eDiD2] 
A 2 + eB2DiC2 [I — eDiD2\ 

1 

u{t) 



-1 1 



X 



x{t) + Di [I — e Di D[ 



1-1 



u(t) 



(2.35) 

When both transfer functions are strictly proper, and matrices Di and D 2 
zero, equations can be simplihed as 



Xl{t) 

X2{t) 



X(t) = 

[ yit) = [ Cl 0 ] x{t) 



Ai e B\ C 2 

B 2 C 1 A 2 



x{t) + 



Bi 

0 



u{t) 



(2.36) 



Mason Formula and Signal-Flow Graphs 

The Mason formula allows us to quickly calculate global transfer functions for 
a block scheme where each block represents a transfer function. A complicated 
scheme such as in Fig. 2.12 is considered. 
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Figure 2.12: Block scheme of a closed-loop process 



This block scheme has two external inputs yr and d, and the transfer func- 
tions set point-output Gyr and disturbance-output Gyd must be respectively 
calculated 

y(s) = GyrYr{s)+GydD{s) (2.37) 

Each block is directed from input towards output and is called unidirectional. 
A loop is a unidirectional path which starts and ends at a same point, and 
along which no point is met more than once. A loop transmittance is equal 
to the product of the transfer functions of the loop. When a loop includes 
summators, the concerned signs must be taken into account in the calculation 
of the loop transmittance. 

Rather than directly working on the block diagram such as is currently de- 
scribed, it is preferable to transform this scheme into a signal-flow graph, which 
contains the topological information of the set of linear equations included in 
the block diagram. The word signal- flow (or signal flow) means a flow of fluxes 
or signals. 

To operate, some characteristic definitions of signal-flow graphs must be 
added: 

• A signal-flow graph is made of nodes and connecting branches (a line with 
an arrow). 

• A node is attributed to each variable which occurs in the system. The node 
i represents the variable yi for example. 

• For a branch beginning in i and ending in j, the transmittance aij of the 
branch relates variables yi and yj. 

• A source is a node from where only branches go out. 

• A sink is a node where only branches come in. 

• A path is a group of connected branches having the same direction. 

• A direct path comes from a source and ends in a sink; furthermore, no node 
should be met more than once. 
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• A path transmittance is the product of the transmittances associated with 
the branches of this path. 

• A feedback loop Bi is a path coming from a node i and ending at the same 
node i. Along a loop, a given node cannot be met more than once. 

• A transmittance of a loop Bi is the product of the transmittances associated 
with the branches of this loop. 

• Loops Bi and Bj are non-touching when they have no node in common. 
First, let us present some simple cases that allow us to understand signal- flow 
graphs as well as the associated equations, which are all linear. 




Figure 2.13: Signal-flow graph: addition 



a\ 



yi 



V2 



Ct2 



Figure 2.14: Signal-flow graph: addition 



^ a2 

y^Q — ^ ^ O 

Figure 2.15: Signal- flow graph: multiplication 



ai 



yi """"/O y2 



— CL2 



Figure 2.16: Signal-flow graph: feedback 



Additions: 

Graph 2.13 corresponds to the linear equation 



y?> = «u/i + «2 y2 



(2.38) 
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V2 

Figure 2.17: Signal- flow graph: feedback 

and graph 2.14 corresponds to the linear equation 

V2 = («i + «2) yi (2.39) 

Multiplication: 

Graph 2.15 corresponds to the linear equation 

ys = CL 2 V 2 = CLl d 2 yi (2.40) 

The transmittance of path from 1 to 3 is aia 2 - 
Feedback: 

Graph 2.16 corresponds to the linear equation 

y 2 =aiyi-a 2 aiy 2 ^V 2 = yi (2.41) 

1 + ai tt2 

The transmittance of path from 1 to 2 is -r -^ — . 

^ l+ai tt2 

Graph 2.17 corresponds to the linear equation 

y 2 = aiyi - a 2 y 2 ^ y 2 = , yi (2.42) 

1 + «2 

Then, the A characteristic function of the block scheme or determinant of 
the signal-flow graph is defined as 
A = 1 (transmittances of the loops), 

+ (products of the transmittances of all non-touching loops 
considered two by two), 

“ YI (products of the transmittances of all non-touching loops 
considered three by three). 

Note that A is independent of the input and the output. 

• A direct path from an input Ui to an output yj is any connection of directed 
branches and of blocks between i and j such that no point is met more than 
once. The input Ui and the output yj are connected by k direct paths each 
having the transmittance Tijk- Let A^j/^ also be the determinant of each 
direct path calculated according to the previous formula, giving A by setting 
equal to 0 all transmittances of the loops which touch the k-th direct path 
from i to j (suppress all the nodes and the branches of this direct path). 
According to the Mason formula, the transfer function of the input Ui to 
the output yj is equal to 

^ _ Yk=l Yjk Yjk 




(2.43) 
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with 



Yj{s)=GijUi{s) ( 2 . 44 ) 

The signal- flow graph corresponding to the previous block diagram 2.12 is 
given in Fig. 2 . 18 . 

(T4 




Figure 2 . 18 : Signal- flow graph corresponding to block diagram 2.12 



It might have been possible on this graph to merge Eq and Y : variables have 
here been distinguished to make the sink Y clearly appear. 

In this signal-flow graph, one wishes to calculates transfer functions from 
T^( 5 ) to T( 5 ) and from D{s) to 1^(5), respectively Gry and Gdy In this 
graph, hve loops exist with respective transmittances: G2G^G^^ —GiG2Gs^ 
—G\G2 Gq^ —GiG/^ and G^G^. There exist no two by two non-touching loops. 
The determinant of this graph is thus equal to 

A = 1 — {G 2 GsG^ — G1G2GS — G1G2GQ — GiG/^ + G/^G^) . . 

= 1 + GiG2Ge + {G 2 GS + G^){Gi - G^) 



To hnd the transfer function Gry, two direct paths must be noticed, one is 
rei 6263646566^ with transmittance Tryi = G1G2G1, for which Aryi = 1, the 
other one is 66162636566^ with transmittance Try2 = GiG/\^ for which Ary2 = 1- 
The transfer function Gry is thus equal to 



^ _ GiG2^3 + ^1^4 

“ 1 + GiG 2 Ge + (^ 2^3 + ^ 4)(^1 - ^ 5 ) 



( 2 . 46 ) 



To hnd the transfer function Gdy^ there exist two direct paths: ^6763646566^ 
with transmittance Tdyi = G2G^G^ for which Adyi = 1, the other is de^e^e^e^y 
with transmittance Tdy2 = G^G^ for which Ady2 = 1 - The transfer function 
Gdy is thus equal to 

1 + GiG2G6 + (G2G3 + G4)(Gi -Gs) 



( 2 . 47 ) 
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Globally, one obtains 

y (s) = Gry y.(s) + Gdy D{s) (2.48) 

2.3 Dynamics of Feedback-Controlled Processes 



Disturbance 




Measurement 



Figure 2.19: Block scheme of the closed-loop process 

The block scheme of feedback control makes use of previously studied elements 
with respect to their general operating principle. In the block scheme of the 
process and control system (Fig. 2.19), independent external inputs are on 
one hand the set point yr{t) imposed by the user and on the other hand the 
disturbance d{t) not mastered by the user; these inputs influence the output 
y{t). Indeed, the process could be subjected to several disturbances. The 
process undergoes differently the action of the control variable u{t) and of the 
disturbance d(t), thus this corresponds to distinct transfer functions denoted 
respectively by Gd{s) and Gp{s), so that, as a Laplace transform, the output 
Y (s) is written as 

T(s) = Gp{s)U{s) + Gd{s)D{s) (2.49) 

A summator will be used for the block representation. Instead of representing 
the block diagram in time space, it is represented as a function of the Laplace 
variable s. 

Other transfer functions indicate the functions of different devices: 

• Measurement: 

l"m(5) = G^(s)T(s) (2.50) 

It must be noted that the measured variable ym{t) generally does not have 
the same dimension as the corresponding output y{t). For example, if the 
output is a temperature expressed in degrees Celsius, the variable measured by 




Process Control 



83 



a thermocouple is in mV, hence the steady-state gain of the transfer function 
has the dimension of mV/Celsius. Similarly, for any transfer function, the 
steady-state gain has unit dimensions. Moreover, the sensor may introduce 
dynamics given by the transfer function Gm{s). 

• Regulation: 

Ua{s) = Gc{s)E{s) (2.51) 

with E{s) being the error equal to 

E{s) = Km Yr{s) - Ym{s) = Km Yr{s) - GmY{s) (2.52) 

• Compensation of the measurement: 

If the set point tjr is expressed in the same units as the output y (pressure in 
bar, temperature in C or K, ...), it is necessary to introduce a block to com- 
pensate the measurement (Fig. 2.19) so that the measured output ym and the 
compensated set point yrm have the same dimension (e.g. mA or mV), which 
is in general different from the output one. The gain Km of the measure- 
ment compensation block is equal to the steady-state gain of the measurement 
transfer function Gm- 



It is also possible to express the set point yr (t) in the same units as the mea- 
sured output ym{t) and, in this case, it is not necessary anymore to compensate 
the measurement {Km = !)• 



In the case of measurement compensation, this pure gain Km 

bv 


is calculated 


Km = lim Gm{s) = Gm{0) 


(2.53) 


so that the compensated set point is equal to 




Yrm{Y) — Km Yj^{s) 


(2.54) 


• Actuator: 




U{s)=Ga{s)Ua{s) 


(2.55) 



From these equations, it is interesting to express the output Y (s) with 
respect to the set point 1 ^( 5 ) and the disturbance E(s). One obtains 



V(s) = G^(s) G,(s) Ge(s) E(s) + Gd{s)D{s) (2.56) 

or, by expressing E{s), 

Y{s) = Gp{s) Ga{s) Gc{s) [Km Yr{s) - Gm{s) Y (s)] + Gd{s)D{s) (2.57) 
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The first term represents the influence of a change of the set point Tr(5) and 
the second term the influence of a change of disturbance D{s). The closed-loop 
transfer function for a set point variation will be 



^ Gp{s)Ga{s)Gc{s)K^ 
set point 1 



(2.59) 



and similarly the closed-loop transfer function for a disturbance variation 



^disturbance ^ Gafs) k(s) G^(s) 

The denominators of both closed-loop transfer functions are identical. These 
closed-loop transfer functions depend not only on the process dynamics, but 
also on the actuator, measurement device and the controller’s own dynamics. 

The application of the Mason formula would give the previous expressions. 
The signal-flow graph is given by Fig. 2.20. 




Figure 2.20: Signal- flow graph corresponding to the block diagram of the 
feedback control (Fig. 2.19) 



In the present case, only one loop exists, and the graph determinant is equal 
to 

A = 1 — {—GcGaGpGm) (2.61) 

Between the set point and the output, only one direct path exists with trans- 
mittance: Ti = KjnGcGaGp and determinant Ai = 1, so that the transfer 
function from the set point to the output is equal to 



y(£)_ ^ G^GgGpK^ 

T^(5) 1 + GcGaGpGm 

Similarly between the disturbance and the output, only one direct path exists 
with transmittance: Ti = Gd and determinant Ai = 1, so that the transfer 
function from the disturbance to the output is equal to 



l^(^) _ Gd 

D{s) 1 + GcGaGpGm 



The calculation of the transfer functions can be resumed in this simple case 
in the following manner: 
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The closed-loop transfer function is equal to [ product of the transfer functions 
met on the path between an input and an output ] over [ 1 + the product of all 
transfer functions met in the loop ]. So, between D{s) and Y ( 5 ) only Gd is met, 
while between Tr(5) and ^( 5 ) we meet Gc, Ga^ Gp. In the loop, Gc, Ga^ 
Gp-) Gm are met. The product Gc Ga Gp (T^, which appears in the denominator 
of the closed-loop transfer functions, is often called open-loop transfer function, 
as it corresponds to the transfer function of the open loop obtained by opening 
the loop before the comparator as can be done for an electrical circuit (Fig. 
2.21). This open-loop transfer function acts as an important role in the study 
of the stability of the closed-loop system. 




Figure 2.21: How the “open loop” must be understood by opposition to the 
closed loop 

Two types of control problems will be studied in particular: 

Regulation: 

The set point is hxed (T^(5) = 0) and the process is subjected to disturbances. 
The control system reacts so as to maintain y(t) at the set point value and tries 
to reject the disturbances: it is also called disturbance rejection. 

Tracking: 

It is assumed (in order to simplify the study) that the disturbance is constant 
{D{s) = 0) while the set point is now variable; the problem is to maintain y(t) 
as close as possible to varying yr{t). 

2.3.1 Study of Different Actions 

To display the influence of different actions, only hrst- and second-order systems 
will be studied. Moreover, to simplify calculations, it will be assumed that the 
transfer functions of the actuator and of measurement are both equal to unity 

Ga = l , Gm = l . Km = l (2.64) 

resulting in the simplihed Fig. 2.22. For these hrst- and second-order systems 
with different types of controller, responses to steps of set point or disturbance 
are given in Figs. 2.29, 2.30, 2.31, 2.32, and commented on in the following 
sections. 

As an aside, to simplify, it will be assumed that the order of process transfer 
function Gp and the order of the transfer function Gd dealing with the distur- 
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Figure 2.22: Block diagram for the study of the action of the different con- 
trollers 

bance are equal (which is by no means compulsory), but that these transfer 
functions have different gains and time constants. 

2.3.2 Influence of Proportional Action 

As the controller is proportional, its transfer function is 

Gc = K, (2.65) 



First-Order Systems 



A first-order system is described by a differential equation such as 

+ yW = "“W (2-66) 

where y(t) is a deviation variable such that ^(0) = 0 and {dy/dt)o = 0. 

The process transfer function linking the output Laplace transform Y (s) to the 
input Laplace transform U (s) is equal to 

G,(.) = ^ (2.67) 

The transfer function for the disturbance is also assumed to be first-order 

G4s) = ^ ( 2 . 68 ) 



The output Y (s) for any set point Tr(^) and any disturbance D(s) is thus equal 
to 



y(s) 



KpK, 

Tp s -\- 1 -\- Kp Kq 



Yr{s) + 



Kd TpS^l ^ 

Td s \ Tp S \ Kp Kq 



(2.69) 



If we set 5 ^ 0 (equivalent to an infinite time) in the transfer functions, the 
closed-loop transfer functions are respectively equal to the closed-loop steady- 
state gains (use of the final value theorem) for the set point yr and the distur- 
bance d 



K 



KpK, 

1 + Kp Kc 



(2.70) 
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K', = 



Kd 



1 + Kj) Kc 



(2.71) 



The closed-loop gain is modified compared to the open-loop gain Kp] 
tends towards 1 when the controller gain is large. The closed-loop gain relative 
to the disturbance is lower than the open-loop gain and tends towards 
0 when the controller gain is large. The closed-loop response is still first-order 
with respect to set point and disturbance variations. 

The open-loop time constant is r^; in closed loop, concerning set point 
variations, it is equal to 



^P = 



1 + iCr, Kc 



(2.72) 



thus it has decreased; the response will be faster in closed loop than in open 
loop. 

Consider the response to a step variation of set point (tracking) or distur- 
bance (regulation): 

Tracking study: 

The set point change is a step of amplitude A 



Yr{s) 



A 

s 



(2.73) 




Figure 2.23: Response of a first-order system {Kp = 5, = 10) to a set point 

unit step (proportional controller with increasing gain: Kc = 1, 2, 5) 



The disturbance is assumed constant or zero (D{s) = 0). The closed-loop 
response (Fig. 2.23) to a set point step is then equal to 



y(s) 



KpKc A 
Tp 5 T 1 T Kp Kc s 



K -4 

T^S + 1 S 



(2.74) 
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To get the time response y{t)^ T( 5 ) is decomposed into a sum of rational frac- 
tions, the first corresponding to the forced response Yf{s)^ the second to the 
natural response Yn{s) 



K K Yr, 

Y^s) =A^- = Yf{s) + y„(«) 



(2.75) 



hence 



y{t) = AK'p (l - exp(-t/Tp)) = yf{t) + yn{t) 
with: yf{t) = AK'p ; yn{t) = -AK'pexp{-t/Tp) 



(2.76) 



Figure 2.23 was obtained for a unit step of a set point. The asymptotic value 
of the output presents an offset with the set point; if the controller gain Kc is 
increased, this offset decreases (Fig. 2.23). In practice, other transfer functions 
must be taken into account: actuator, measurement, so that this set may not 
behave exactly as a first-order process and present e.g. a time delay, nonlinear- 
ities or neglected dynamics. The choice of too large gains for the proportional 
controller may render the closed-loop behaviour oscillatory or unstable. A high 
gain decreases the response time, imposing a more important demand to the 
actuator: the control variable u(t) varies more strongly and more quickly so 
that it can reach its limits, it is then saturated. 

Regulation study: 

Recall that the transfer function for the disturbance was taken to be first-order 
as that of the process, but with different gain ^ Kp and time constant 

Consider a disturbance step variation of amplitude A 



D{s) = 



(2.77) 



The set point is assumed constant (regulation). The closed-loop response (Fig. 
2.31) to a step disturbance is then equal to 



T(s) = 



Kd 



TpS + 1 



TdS + 1 Tp5 + 1 + KpKc S 
To get the time response y{t)^ Y (s) is decomposed into 

I ^ Kd Td {Tp - Td) 1 



r(5) = A 



+ 



1 

- + 



KpKc s (1 + Kp Kc) - Tp TdS + 1 



KdKpKcT^ 



1 



= + 
5 



{jp Ti (1 T Kp Kq)) (1 T Kp Kq) TpS T 1 T Kp Kq 

C2 , C3 



+ 



TdS + 1 TpS + 1 + Kp Kc 
hence the closed-loop response 



(2.78) 



(2.79) 



y{t) = A [ci + C 2 exp(-t/r^) + C 3 exp(-t (1 + Kp Kc)/Tp)] 



(2.80) 
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In the absence of a controller, the process is stable and the output tends towards 
AKd. When the proportional controller is introduced, the process remains 
stable; for a unit step disturbance, the output tends towards a new limit AK'^ 
and deviates with respect to the set point from this value AK'^. 

When the gain of the proportional controller increases, the deviation output-set 
point decreases and the influence of the disturbance decreases too. 

Consider now a disturbance impulse variation of amplitude A. The closed- 
loop response to this disturbance is then equal to 



Y(s) =A 



K, 



r^s + 1 



TdS -\- 1 TpS + 1 + KpKc 
Application of the hnal value theorem gives 



lim y(t) = lim[5T(5)] = 0 

t^oo s^O 



(2.81) 



(2.82) 



thus impulse disturbances are rejected by a simple proportional controller. 



Second-Order Systems 

Again, the actuator and measurement gains and transfer functions are taken 
to be equal to 1. 

In the case of a second-order system, the process transfer function is 



Gp{s) 



Kp 

+ 2 CpTpS +1 



(2.83) 



Assume that the transfer function for the disturbance is also second-order 



Gd{s) — 



Kd 



(2.84) 



+2 CdTdS+1 

The output T(s) for any set point V(s) and any disturbance D{s) is equal to 



T(s) 



y fsj 

s'^ + 2 (pTpS + 1 + KpKc 

Ad + 2 CpTpS + l 

tJ _|_ 2 Cd Td S -I- 1 T^s'^ + 2 CpTpS + l+ KpKc 



(2.85) 



Tracking study: 

In Eq. (2.85), only the term of the set point variation is concerned. For a set 
point step variation of amplitude A, Tr(5) becomes A/s, and T(s) is decom- 
posed into a sum of two fractions, the natural response of order 2 corresponding 
to the hrst factor of the previous expression and the forced response in 1/s. 
The closed-loop transfer function remains second-order as in open loop. The 
period and the damping factor are modihed 



^ 

^ ^l + KpKc ^1+KpKc 



( 2 . 86 ) 
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The steady-state gain becomes 



K 



KpKc 

1 + Kp Kc 



(2.87) 



Like for the first-order system, a deviation between the set point and the 
asymptotic response exists (Fig. 2.30), which is all the more important as the 
gain is low. 

Regulation study: 

As the influence of disturbance is studied, the second term of Eq. (2.85) is 
taken into account. The closed-loop response remains second-order as in open 
loop. The proportional controller is not sufficient to reject the disturbance: 
a deviation between the set point and the asymptotic value still exists (Fig. 
2.32). 

A proportional controller does not change the order of the process; the 
steady- state gain is modified, decreased in two cases (a/ if > 1, or b/ if 
Kc > 1/(1 — Kp) when Kp < 1): the time constants also decrease. 



2.3.3 Influence of Integral Action 

The study is similar to that realized in the case of the proportional controller 
and will be consequently less detailed. 

The transfer function of a PI controller is equal to: 

Ge(s) =Kc(^l + (2.88) 



First-Order Process and Influence of Pure Integral Action 

Though integral action is never used alone, in this section, in order to charac- 
terize its influence, we first assume that the controller is pure integral and has 
the following transfer function 



Ge(s) = — 

T/ S 



(2.89) 



In the case of a first-order process, the response Y (s) to a set point or distur- 
bance variation is equal to 



Kr, 



Kr 



Y{s) = 



I +TpS Tj S 



1 -f 



Kp ^ 

1 -I- T„ S Tl S 



-Yr{s) + 



1 -f 



Kd 

1 + 

Kp ^ 

1 -I- T„ S Tl S 



■D{s) 



(2.90) 



y(s) = 



TpTi 

KpK, 



+ 



Tl 



KpK, 



s -I- 1 



•V(s) + 



Kd 



{TpS + l)TlS 



TdS + l TjTp s'^ + Tj S + Kp K, 



■D{s) 



(2.91) 




Process Control 



91 



The integral controller has modihed the system order: the transfer function of 
the closed-loop system is now of order 2, i.e. larger by one unity than the order 
of the open-loop system. The natural period of the closed-loop system is equal 
to 



TpTi 

KpK, 



(2.92) 



and the damping factor 



c' = - / 

2\lTpKpKc 



(2.93) 



As the response of a hrst-order process in open loop becomes second-order in 
closed loop, its dynamics is completely different. According to the value of the 
damping factor the response will be overdamped, or underdamped, possibly 
explosive. If the controller gain is increased, keeping constant the integral 
time constant, the natural period and the damping factor decrease, thus the 
response will be less sluggish, but the displacement will move progressively 
from overdamped responses towards oscillatory responses. 

It is interesting to study the tracking, i.e. the response to a set point 
variation. The set point undergoes a step variation of amplitude A 



Yr{s) 



A 

s 



hence 



T(s) = 



TpTi 

KpK, 






Tl 



KpK, 



■5 + 1 



To find the asymptotic behaviour, the final value theorem gives 



(2.94) 

(2.95) 



lim y(t) = lim[5T(5)] = A (2.96) 

t^oo s^O 

thus the limit of y{t) is equal to A, the set point value. We thus find the 
important result that the integral action eliminates the asymptotic deviation. 
The value of the set point is reached faster when the gain is high, but at the 
expense of oscillatory responses. According to the type of controlled variable, 
it is preferable to rather choose an overdamped response (not going beyond the 
set point, e.g. for a chemical reactor which could undergo runaway above some 
safety temperature) or oscillatory (rapidly reach a state close to the set point). 

Let us study in a similar way the regulation, thus the infiuence of a distur- 
bance. Consider a disturbance step change of amplitude A that could not be 
rejected by the proportional controller 

D{s) = 4 (2.97) 



y(s) 



Kd {Tp s + 1 )tis a 

TdS + 1 TjTp +Tl S + KpKc S 



hence 



(2.98) 
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The final value theorem gives 

lim y(t) = lim[5T(5)] = 0 (2.99) 

t^oo s^O 

thus step-like disturbances which were not rejected by a proportional controller 
are perfectly rejected owing to the integral action. 

Of course, impulse disturbances are also rejected by the PI controller. 



First-Order Process with PI Controller 

The transfer function of the PI controller is equal to 

Gc{s) = Kc ^1 + ( 2 . 100 ) 

hence the general closed-loop response to set point and disturbance variations 






Kd 



1 +7-^5 



or 



1+ (l + — 

I +TpS \ Tj S 



1 + 






Y{s) = 



Tj S+1 



'TpTi 2 I + 1 -I 

s + T/ — — s + 1 



1 + T„S 



V(s) + 



K, 1 



KpK, 



KpKc 

Tl 



Kd 



KpKc 



S (1 + Tp S) 



TdS + 1 ^ KpKc + 1 ^ ^ ^ 



D{s) 



KpKc 



KpKc 



1 

Tj S 



D{s) 



( 2 . 101 ) 



( 2 . 102 ) 



Compared to the proportional action alone, the order of each transfer function 
output /set point or output /disturbance increases by one unit. 

Previously drawn conclusions for the integral action alone remain globally 
true: 

• In tracking, during a set point step variation (Fig. 2.29), the output 
tends towards the set point even for low controller gain. 

• In regulation, impulse disturbances are of course rejected, but also step 
disturbances (Fig. 2.31). 

To be convinced, it suffices to use the final value theorem. 



Second-Order Process 

Similarly, in the case of a second-order process in open loop, the closed-loop 
output with a pure integral controller would be of the immediately next order, 
i.e. a third-order. 

The PI controller used in the case of the tracking corresponding to Fig. 2.24 
leads to oscillations all the more important for the second-order system in that 




Process Control 



93 




Figure 2.24: Response of a second-order system {Kp = 5, = 10, = 0.5) to 

a set point unit step with influence of the integral time constant (PI controller 
Kc = 2,r/ = 10 or 20 or 100). When r/ increases, the oscillation amplitude 
decreases 



the integral time constant is lower, and thus that the integral gain increases. 
The deviation with respect to the set point is cancelled by the integral action 
either for a set point variation (Fig. 2.30), or a disturbance variation (Fig. 
2.32). With the overshoot increasing with the integral gain, it will be frequently 
necessary not to choose too large an integral gain. In those hgures, the gain 
and the integral time constant have not been optimized, as the objective was 
only to display the influence of the integral action. 

2.3.4 Influence of Derivative Action 

The transfer function of a pure ideal derivative controller is equal to 

Gc{s) =KcTds (2.103) 

This transfer function is improper and the ideal PID controller (Fig. 2.25) of 
transfer function 

Gc{s) = Kc ^1 + + T]J (2.104) 

is not realizable, as the numerator degree is larger than the denominator degree 
because of the ideal derivative action term. If this controller were used as such, 
it would amplify high-frequency noise because its amplitude ratio is unlimited 
at high frequency (see frequency analysis. Chap. 5). 

The following study simply aims to demonstrate the characteristics of pure 
derivative action. 
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Figure 2.25: Block diagram of the ideal PID controller 



First-Order Process and Pure Derivative Action 



In the case of a first-order process, if one only looks at the influence of derivative 
action with the controller given by Eq. (2.103), the response Y (s) to a set point 
or disturbance variation is equal to 



y(s) = 



Kp 

TpS +1 



KcTd s 



1 + 



Kp 

TpS + 1 



KcTd s 



-V(s) + 



1 + 



Kd 

TdS + 1 
Kr, 



T„S + 1 



KcTd s 



-D{s) 



Kp Kc td s 



(Tp + KpKcTD)s + 1 



Y (s) H — TpS + 1 

TdS + l {Tp+KpKcTD)s + l ^ > 

(2.105) 

Transfer functions are first-order as in open loop, thus the derivative action 
has no influence on the system order. On the other hand, the derivative action 
introduces a lead term in the numerator. The closed-loop time constant is equal 
to 

Tp = Tp + Kp Kc Td (2.106) 



and thus is increased with respect to the open loop; the closed-loop response 
will be slower than the open-loop one, and this effect increases with the deri- 
vative controller gain. This will help to stabilize the process if the latter shows 
tendencies to oscillations in the absence of the derivative action. 



First-Order Process with Real PID Controller 

Compared to the PI controller previously studied, a physically realizable deri- 
vative action is introduced by the following real PID controller (Fig. 2.26) of 
transfer function 



^c(^) = Kc 



TjS + 1 

TjS 



TpS + 1 
(3 TpS + 1 



(2.107) 



which is physically realizable. This transfer function can be seen as the filtering 
of an ideal PID controller by a first-order filter. In the case of a pneumatic PID 
controller, (3 is included between 0.1 and 0.2. For the electronic PID controller, 
one sets 0 < /3 <C 1. 
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Figure 2.26: Block diagram of the real PID controller given by Eq. (2.107) 



A real PID controller (Fig. 2.27) can also respond to the following slightly 
different equation, which is frequently used 



^c(^) = K, 



1 + — + 
TjS 



TdS 



IR. 

N 



5 + 1 



(2.108) 




Figure 2.27: Block diagram of the real PID controller given by Eq. (2.108) 



In the case of studied first-order processes, the derivative action in the PID 
controller does not seem to add an important effect with respect to integral 
action alone, as the studied process already presents a closed-loop overdamped 
behaviour with the PI controller. If for other parameter values the closed- 
loop behaviour had been underdamped, the addition of derivative action would 
have allowed considerable decrease of oscillations which would have become 
acceptable as in the following case of the second-order process (Fig. 2.28). 
The infiuence of the derivative action is clearer in response with respect to a 
disturbance step variation (Fig. 2.31) than in response with respect to a set 
point step variation (Fig. 2.29). It is shown that the overshoot is decreased. 
The derivative action thus brings a stabilizing infiuence with respect to integral 
action. 
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Second-Order Process 



In the case of a second-order process and pure integral action, the response 
Y (s) to a set point variation is equal to 



y(s) 



Gp Kc Tp s 
1 + Gp Kc td s 



V(s) 



Kp Kc Tp s 

t^s^ + 2(ts + 1+ Kp Kc T£) s 



Yr{s) (2.109) 



The closed-loop response is second-order as it was in open loop. The derivative 
controller does not modify the order of the response 



T(s) 



Kp Kc Tp s 

(2 Ct + Kp Kc Td)s 1 



V(s) 



( 2 . 110 ) 



In this case, the time constant t remains the same while the damping factor 
of the closed-loop response is modihed with respect to the open-loop damping 
factor and becomes 



c = 



‘I C^p T -\- Kp Kq T]j 

27 



( 2 . 111 ) 



The response is thus more damped in closed loop than in open loop, and this 
damping increases with the gain Kc of the derivative controller and with the 
derivative time constant. 

Globally, the same effect is noticed with a real PID controller of transfer 
function given by Eq. (2.107). 




Figure 2 . 28 : Response of a second-order system {Kp = b^Tp = 10, (p = 0.5) to 
a set point unit step (real PID controller with influence of the derivative time 
constant td- Kc = 2, t/ = 20, td = 0.1 oil or 10, (3 = 0.1). When td increases, 
oscillations decrease 

Compared to the integral action which cancels asymptotic deviation but leads 
to strong oscillations, the addition of real derivative action strongly decreases 
oscillations which become acceptable, all the more so as the derivative time 
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constant td is higher (Fig. 2.28). However, it must be noted that the increase 
of derivative action tends to increase measurement noise and that this effect is 
not wished, so that a too large value of td must be avoided in practice. The 
derivative action brings a stabilizing effect with respect to integral action. The 
overshoot is also decreased. These two effects are clear in the study of the 
inffuence of either a set point step variation (Fig. 2.30) or a disturbance step 
variation (Fig. 2.32). 

2.3.5 Summary of Controllers Characteristics 

A proportional (P) controller contains only one tuning parameter: the con- 
troller gain. The asymptotic output presents a deviation from the set point, 
which can be decreased by increasing the controller gain. The use of too large 
gains can make the process unstable due to neglected dynamics or time delays. 

A proportional-integral (PI) controller presents the advantage of integral 
action leading to the elimination of the deviation between the asymptotic state 
and the set point. The response is faster when the gain increases, and can be- 
come oscillatory. For large values of the gain, the behaviour may even become 
unstable. The decreasing of the integral time constant increases the integral 
gain and makes the response faster. Because of the integral term, the PI con- 
troller may present a windup effect if the control variable u becomes saturated. 
In this case, the integral term becomes preponderant and needs time to be 
compensated. It is preferable to use an anti- windup system (Sect. 4.6.4). 

The proportional-integral-derivative (PID) controller presents the same in- 
terest as the PI with respect to the asymptotic state. Furthermore, the deri- 
vative action allows a faster response without needing to choose too high gains 
as for a PI controller. This derivative action thus has a stabilizing effect. 

The ideal PID controller is indeed replaced by a real PID controller, the 
transfer function of which given by Eq. (2.107) or (2.108) is physically realiz- 
able. In the case of a pneumatic PID controller, (3 is included between 0.1 and 
0.2. For the PID electronic controller, one sets 0 < /3 <C 1. 
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Figure 2.29: Comparison of the influence of the controller type on the res- 
ponse of a first-order system {Kp =b^Tp = 10) to a set point unit step. (Pro- 
portional: Kc = 2 (top). Proportional-integral: Kc = 2,r/ = 20 (middle). Real 
proportional-integral-derivative: Kc = 2,r/ = 20, td = 1,(3 = 0.1 (bottom)) 




re 2.30: Comparison or the mhuence or the controller type on the response 
second-order system {Kp = 5,Tp = 10, Cp = 0.5) to a set point unit 
(Proportional: Kc = 2 (top). Proportional-integral: Kc = 2, r/ = 20 



IleC Real DroDortional-intesfral-derivative: Kr. = 2.rr = 20. rn = 1,3 = 
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Figure 2.31: Comparison of the influence of the controller type on the response 
of a first-order system {Kp = 5, = 10 : = 2, = 2) to a disturbance unit 

step. (Proportional: Kc = 2 (top). Proportional-integral: Kc = 2, r/ = 20 
(middle). Real proportional-integral-derivative: Kc = 2,r/ = 20, td = l,/3 = 
0.1 (bottom)) 




Figure 2.32: Comparison of the influence of the controller type on the respons* 
of a second-order system {Kp = b^Tp = 10, (p = 0.5 : = 2, = 2, Q = 0.25 

to a disturbance unit step. (Proportional: Kc = 2 (top). Proportional-integral 
Kc = 2,r/ = 20 (middle). Real proportional-integral-derivative: Kc = 2,r/ = 



20. Tn = 1. d = 0.1 fbottomP 
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Chapter 3 

Stability Analysis 



The stability analysis relies on the same mathematical concepts, whether the 
system is in open or closed loop. Naturally, functions, variables, matrices, etc., 
which will be the object of the study, will be different. The cases of linear and 
nonlinear systems will be studied separately. 

A system is stable if a bounded output corresponds to a bounded input in 
the limits of the problem physics (a curve of undamped sinusoidal allure is not 
bounded): it is often called BIBO. (Bounded input, bounded output). 

However, the previous definition of stability is not totally satisfactory for a 
closed-loop system: indeed, not only the external stability should be ensured, 
but also the internal stability, i.e. all the internal signals inside the closed 
loop must be bounded (Vidyasagar, 1985). Note that when the used controller 
is stable, then external stability induces internal stability. 



3.1 Case of a System Defined by its Transfer 
Function 



In Sect. 1.9.5, the response of a linear system, defined by its transfer function 



G{s) 



N{s) 

D{s) 



degree of < degree of D 



(3.1) 



to an input step has been studied. This allows us to demonstrate the conditions 
of stability: 

• a system is stable when the poles of its transfer function (roots of the de- 
nominator D{s)) are all negative real or complex with negative real part. 
These poles will be said to be stable. 

• if a complex pole has its real part equal to zero, the system is called 
marginally stable. 

• a system is unstable when one or several poles of its transfer function are 
positive real or complex with a positive real part. This type of poles is said 
to be unstable. 
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A system is minimum-phase when all zeros of its transfer function (roots of 
numerator N{s)) are all negative real or complex with a negative real part. 
When there are one or several positive real or complex with positive real part 
zeros, the system is called nonminimum-phase. 

3.2 State-Space Analysis 

3.2.1 General Analysis for a Continuous Nonlinear 
System 

Consider a physical system, the time evolution of which is described by a non- 
linear ordinary differential equation of order n as 

^ = = ( 3 . 2 ) 

Introducing new variables Xi = (1 < ^ the previous differential 

equation can be written as a set of first-order ordinary differential equations 

X2 

(3.3) 

Xn 

/(Xl, 

which is the description of this nonlinear system in state space; variables Xi are 
the system states. 

Frequently, a nonlinear system will be indeed represented by a system of 
ordinary differential equations of the form 

xi = fi{xi 

Xn — fn{x^ 

which we will denote by 

x = f{x,t) , x{0) = xo (3.5) 

Solutions (^t(^Co) of fho previous differential system are called integral curves 
of the vector field / and depend on the initial states Xq. These curves can be 
parameterized with respect to t and are also called trajectories; they represent 
the locus of the successive states of the system. They verify (^t(^^^oTo) = 

When the time variable t does not intervene explicitly, the system is called 
autonomous. Any autonomous nonlinear continuous system can be represented 
by the following set of ordinary differential equations 





X = f{x) 



a;(0) = xo 



(3.6) 
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A point is called stationary (or singular) (or equilibrium) if 

fix*) = 0 (3.7) 

If the stationary point is not placed at the origin, it can be brought to the 
origin by making the translation x' = x — x* . Thus, discussions and theorems 
consider the origin as the stationary point around which the stability is studied. 

It is possible to dehne rigorously from a mathematical point of view several 
kinds of stability: 

Lyapunov Stability : 

The stationary point at the system origin (3.5) is stable if for all e > 0, there 
exists a scalar ce(e,to) such that ||a:^o|| < implies lim ||(^t(^^^oT)|| < for fol 
t > to. 

Lyapunov stability means that all trajectories remain in a neighbourhood 
of the origin. The considered norm is the Euclidean norm^. 

Asymptotic stability: 

The stationary point at the system origin (3.5) is quasi-asymptotically stable 
if there exists a scalar (3 such that ||^Co|| < P induces limt^oo = 0- 

The asymptotic stability furthermore implies that the origin must also be 
stable, as all trajectories tend towards the origin. 

Instability means that trajectories move away from origin. 



First-Order Stability 

The continuous-time nonlinear system is linearized around the stationary point 
a:^*, thus its linear approximation 



5x = A5x with: Sx = x — x"" 



(3.8) 



governs the time evolution when the system is perturbed from Sx around the 
stationary point x"^ . A is the Jacobian matrix or stability matrix 



dfi{x) 


dfi{x) 


dxi 


dXn 


dfnix) 


dfnix) 


dxi 


dXn 



If the eigenvalues^ Xi of A are distinct, if the associated eigenvectors are 
noted rji^ the trajectory can be described (Parker and Chua, 1989) at hrst-order 
by 

(ptix* + Sx) = X* +ci exp{\it)rii + . . . + c„ exp(A„t)7y„ (3.10) 

where c* are constant coefficients satisfying the initial condition. A stationary 
point such that no eigenvalue has a real part equal to zero is called a hyperbolic 

^The Euclidean norm of a vector is the square root of the sum of the squares of its 
elements, i.e. the “length” of a vector. 

2 The eigenvalues of a matrix A are the roots of its characteristic polynomial of degree 

n, equal to the determinant of the matrix A — XI. 
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point. Only hyperbolic points are considered in this framework (Guckenheimer 
and Holmes, 1986). Two cases are possible: 

• If the eigenvalue Xi is real, it represents a contraction velocity if < 0 and 
an expansion one if > 0, in the neighbourhood of the stationary point 

in direction 77 ^, 

• If the eigenvalue Xi is complex, its conjugate complex A^ is also an eigenvalue 
as A is a real matrix; the conjugate vector fji is the eigenvector associated 
with the eigenvalue A^. The contribution 

Ci exp{Xit)f]i + Ciexp{Xit)f]i (3.11) 

describes a spiral in the plane generated by lZe{r]i) and Xm{rji). The 
real part of eigenvalue Xi represents a contraction velocity of the spiral 
if lZe(Xi) < 0 and an expansion one if lZe(Xi) > 0. 

• If all eigenvalues of A have their real part strictly negative, the stationary 
point is asymptotically stable. 

• When all eigenvalues have their real part positive, the stationary point is 
unstable. 

• When some eigenvalues have their real part negative and others their real 
part positive, the stationary point is a saddle point. 

• When all eigenvalues are real, the stationary point is a node. 

• When all eigenvalues are in the same complex half-plane (either left or right), 
and some of them are complex, the stationary point is a focus. 

In dimension 2 , when the eigenvalues are equal and when the matrix A is 
diagonalizable, the stationary point is a focus ; if matrix A is not diagonalizable, 
the stationary point is a node (d’ Andrea Novel and de Lara, 1994). 

In the case of a stable node, this stationary point is an attractor: if a 
trajectory originates from a point situated in the neighbourhood of this point, 
it converges directly towards the attractor without turning around it. 

In the case of a stable focus, this stationary point is an attractor: if a 
trajectory originates from a point situated in the neighbourhood of this point, 
it converges towards the attractor, turning around it. 

In the case of an unstable node and of an unstable focus, trajectories move 
away from the stationary point. 

Stability properties are more easily visualized in a plane (n = 2 ) which 
easily allows a graphical representation. This representation made in the phase 
plane or phase space is called a phase portrait and the variables Xi are the 
phase space coordinates. When n > 2, the followed space directions interfere 
and strongly complicate the discussion (Gilmore, 1981). 

Asymptotic Stability Domain 

The asymptotic stability domain (Pellegrini et ah, 1988) of a point is the 
set of regular points Xo such that 

lim x{t^ Xo) = x"" 



(3.12) 
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The frontier of an asymptotic stability domain is formed by the set of tra- 
jectories. If the asymptotic stability domain is limited, its frontier is formed 
either by a limit cycle, or by a phase polygon, or by a closed curve of critical 
points. The frontier can be determined by the following method (Pellegrini 
et ah, 1988): 

• The stationary points of the system x = f{x) are determined by solving 

f(x) = 0. 

• Their stability analysis is realized by calculating the eigenvalues of the li- 
nearized system. 

• An arbitrarily small stability neighbourhood is determined around each sta- 
tionary point. 

• The system 

X = -f(x) (3.13) 

is integrated, starting from the points in the neighbourhood of the stationary 
points. 

• The integration of both concerned systems (3.6) and (3.13) is realized in the 
neighbourhood of the other critical points. 

• Lastly, the asymptotic stability domain is determined by topological con- 
siderations. 

It must be noted that the integration of system (3.13) is equivalent to a back- 
ward integration (with respect to time) of system (3.6). The trajectories of 
system (3.13) are the same as those of system (3.6), but directed in the oppo- 
site way. Moreover, the asymptotically stable stationary points of Eq. (3.6) 
become unstable for Eq. (3.13). 

A limit cycle is stable when close trajectories come near it asymptotically; 
it is unstable when they move away. 

Lyapunov Direct Method 

The Lyapunov direct method or Lyapunov second method is an extension of 
the idea of the mechanical Lagrange energy to study the behaviour of the sys- 
tem with respect to stability without needing to solve its equations (Storey, 
1979). In chemical engineering and, in particular, for chemical reactors consi- 
dered in open or closed loop, some important references should be cited (Aris 
and Amundson, 1958; Berger and Perlmutter, 1964; Himmelblau and Bischoff, 
1968). Eor linear systems, the construction of a Lyapunov function is easy to 
realize, seldom for nonlinear systems, but its theoretical interest is important. 
Eor a nonlinear system, it is sometimes possible to realize the study in the 
neighbourhood of an equilibrium point by hrst making a linearization and then 
studying the linear approximation. 

The Lyapunov function (Eig. 3.1) V{x) is a function such that the surfaces 
dehned hj V = constant are closed, concentric and decrease towards the origin 
(or the stationary point) when ||a:^|| ^0. Moreover, the trajectories cross these 
surfaces towards the inside when the origin is a stable point. 

Stability and asymptotic stability theorems: 
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^ h4 



Figure 3.1: Typical shape of a Lyapunov function 



• If V{x) is definite^ in the neighbourhood of the origin, 

• If V{0) = 0, 

• If V{x) is semidefinite (resp. definite) and of opposite sign to V{x), 

then the origin is in a stable state (resp. asymptotically stable) of equilibrium 
of the system. 

The function V{x) responding to these conditions is a Lyapunov function 
of the system. Notice that the derivative of V {x, t) is equal to 



V{x,t) 



E 



dV dx, dV 
dxi dt ^ dt 



(3.14) 



Asymptotic stability theorem in all space: 

the origin is asymptotically stable in all space (for all trajectories coming from 
all the phase space points) or completely stable if there exists a function V (x) 
such that: 

• V (x) is positive definite Wx. 

• Its derivative V{x) is negative definite Wx. 

• V (x) oo when ||a:^|| ^ oo. 

As has already been mentioned, the construction of Lyapunov functions is often 
delicate, sometimes impossible (Himmelblau and Bischoff, 1968; Storey, 1979; 
Warden et ah, 1964). The linear case is well adapted to this construction. 
Consider the linear system 

x = Ax (3.15) 

and the quadratic form 

V{x)=x^Bx (3.16) 

^A function /(a?) is positive definite in a domain T> around the origin if / and its partial 
derivatives df/dxi exist and are continuous in V; if furthermore /(O) = 0, V{x) > 0 for 
a? / 0. It is semipositive definite if V{x) > 0 for a? / 0. 
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its derivative is equal to 



V{x) = Bx + x^Bx = x"^{A^B + BA)x (3.17) 



Theorem: 

The origin is asymptotically stable (the eigenvalues of A have their real part 
negative) if and only if there exists a matrix B symmetrical positive dehnite, 
solution of the Lyapunov equation 

A^B + BA + C =Q (3.18) 

for all C a positive dehnite symmetrical matrix. 

By using this theorem, the derivative of V{x) is equal to 

V{x) = -x^Cx (3.19) 

thus is negative definite. It results that the function V (x) = x^Bx satisfying 
these conditions is a Lyapunov function. 



3.2.2 Case of a Linear Continuous System 

Example 3.1: Stability Study of an Harmonic Oscillator 

A remarkable example of stability study allowing visualization is the harmonic 
oscillator modelled by the second-order ordinary differential equation 

9 d^x ^ dx , , 

The stationary point corresponds to x = 1. Using this condition, set = x — 1 
and X 2 = xi. The equation can be written in the phase space as 



Xi = X2 

X2 = -{2(tx 2 ^ Xi)/r‘^ 



(3.21) 



which corresponds to a stable system. Three cases have been studied: 

• Underdamped second-order (r = 1, (“ = 0.5). 

• Critically damped second-order (r = 1, (“ = !). 

• Overdamped second-order (r = 1, (“ = 2). 
with the same initial conditions {x\ = 2, X 2 = 1). 

For each case, the states have been represented versus time (left of Fig. 3.2) 
and the phase portrait corresponding to the curves of state X 2 with respect to x\ 
(right of Fig. 3.2). In the left-hand figures, it appears that the states converge 
towards 0 when time becomes large. In the phase portrait, the trajectory tends 
towards the origin; if the time parameter were represented on the trajectories, 
the time arrow would be directed towards the origin, i.e. the convergence 
point. For the underdamped second-order system, the origin is a stable focus. 
For critically damped and overdamped second-order systems, the origin is a 
stable node. 
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Figure 3.2: Case of a second-order stable linear system. Left-hand figures: 
representation of the states versus time. Right-hand figures: phase portrait. 
(Top: r = 1, ( = 0.5, middle: r = 1, C = bottom: r = 1, C = 2) 



It is possible to study a second-order unstable linear system represented by 
the following ordinary differential equation 

9 d^x ^ dx , , 

This system has no stable stationary point. By performing the same variable 
change as previously, the system becomes in the state space 



Xi = —X2 

X2 = -{2(tX2 + 



(3.23) 



The origin is a stationary point. This system has been studied for four different 
initial conditions (cases (a), (b), (c), (d) of Fig. 3.3) corresponding to: 
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Figure 3.3: Case of a second-order unstable linear system. Left-hand figures: 
representation of the states versus time. Right-hand figures: phase portrait 



(a) xi(0) = 2, X 2 ( 0 ) = 1; (b) xi(0) = -2, X 2 ( 0 ) = 1; (c) xi(0) = 2, X 2 ( 0 ) = 
-1; (d) xi(0) = -2, X2(0) = -l. 

It is not necessary for the time to become important for the states to increase 
rapidly (in absolute value). In the phase portrait, the points on the trajectories 
move away from the initial point, with a tendency to come near to the origin 
and then move away without tending towards any limit. It even appears that 
asymptotes depend on the initial point. The origin in fact here is a saddle 
point. 



3.2.3 Case of a Nonlinear Continuous System: 

The Polymerization Reactor 

Polymerization reactors are well known for their behaviour, which presents se- 
veral stationary points because of the high exothermicity of the polymerization 
reaction. 

Example 3.2: Stability Study of a Polymerization Reactor 

The studied example drawn from Hidalgo and Brosilow (1990) concerns styrene 
polymerization (Fig. 3.4). The continuous reactor is fed by three independent 
streams of monomer, initiator and solvent. It is cooled by a heating-cooling 
fiuid circulating in a jacket. 
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Figure 3.4: Continuous polymerization reactor 



The dynamic simulation model reads 



Xi 

X2 

< 

Xs 

X4 



{FjCia - FqXi) 

V 

{Fjyi CjYia Fq X2) 

Fq {Tg - Xs) _ 

V 

Fj(Fj^ifi X4) 



- kdXi 
— kpX2F 

AH 



p Cp 



. ^ UA ^ 

kpX2iZ - (X 3 - X 4 ) 






+ 



UA 

Pj Cpj Vj 



pCpV 

(X 3 - X4) 



(3.24) 



where the states are, respectively: 

X\: initiator concentration, 

X 2 '- monomer concentration, 

X 3 : temperature, 

X 4 : jacket temperature. 

The signification of the parameters and variables and their values are given 
in Table 3.1. 

The total outlet volume fiow rate Fq is equal to the sum of the inlet fiow 
rates Fq = Fi F F^ + Fg. The initiator, the monomer and the solvent are 
assumed to enter at the same temperature T^. 

The chain concentration of growing polymer is equal to 



n={2fkdXilktF (3-25) 



The dissociation, propagation and termination rate constants follow the Arrhe- 
nius law: ki = kio exp(— ^^/RT), {i = d, p or t). 
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Table 3.1: Nominal variables and main parameters of the continuous poly- 
merization reactor 



Feed solvent flow rate 


Fs = 0.1275 1/s 


Feed monomer flow rate 


Fm =0.105 N 


Feed initiator flow rate 


Fi = 0.03 1/s 


Feed monomer concentration 


Cma = 8.698 mol/1 


Feed initiator concentration 


Cia = 0.5888 mol/1 


Feed temperature 


Ta = 330 K 


Reactor volume 


V = 3000 1 


Jacket volume 


Vj = 3312.4 1 


Inlet jacket temperature 


Tj,in = 295 K 


Density of the reacting mixture x heat capacity 


pCp = 360 


Heat-conducting fluid flow rate 


Fj = 0.131 1/s 


Density of the heat-conducting fluid x heat capacity 


Pj Cpj = 966.3 


Global heat transfer coefficient x heat exchange area 




between the jacket and the reactor contents 


UA = 70 


Initiator efficiency 


/ = 0.6 


Heat of reaction 


AiJ = -16700 


Preexponential factor for dissociation 


kdo = 5.95 X 1013 


Activation energy for dissociation 


Ed/R = 14897 K 


Preexponential factor for propagation 


kpQ = 1.06 X 10^ 


Activation energy for propagation 


Ep/R = 3557 K 


Preexponential factor for termination 


kto = 1.25 X 109 


Activation energy for termination 


Et/R = 843 K 



The conversion rate corresponding to the reaction advancement is equal to 

X = {Fm Cma ~ Fo X^) / {Fm Cma) (3.26) 

To demonstrate the multiple stationary states, a series of simulations was 
performed with the same initial states x\ = 0.0486 mol/1 and X 4 = 316.2 K, but 
with a variable monomer concentration X 2 as well as the reactor temperature 
X 3 , which varied from 305 K to 350 K to obtain Fig. 3.5. Fixed initial values 
were chosen in accordance with the stationary values indicated by Hidalgo and 
Brosilow (1990). The curves drawn on the hgure are the trajectories followed 
by an operating point with respect to time. One notices that at temperatures 
lower than or equal to 345 K, trajectories end at the stable stationary point 
at low conversion y = 0.045 and low temperature xs = = 323.6 K. On the 

other hand, at temperatures higher than or equal to 346 K, trajectories end at 
the stable stationary point at high conversion y = 0.800 and high temperature 
xs = Tr = 406.0 K. The unstable stationary point thus corresponds to an 
intermediate temperature around 345 or 346 K. Trajectories tend to come closer 
to the unstable point that can be graphically approximately situated at = 
344.6 K, y = 0.24. 
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Figure 3.5: Phase portrait: conversion versus temperature for a styrene po- 
lymerization reactor 



To get the stationary points, it suffices to transform the previous set of 
ordinary differential equations by cancelling the time derivatives. One thus 
obtains a nonlinear system of algebraic equations which can be solved by the 
Newton-Raphson method. The obtained solution depends on the initial point, 
which was chosen to be equal to xi = 0.05 mol/1, X 2 = 2.0 mol/1 and X 4 = 300 
K, except for the reactor temperature X 3 , which varied from 320 K to 400 K. 
Thus, the two stable stationary points were found: 

xi = 0.06683 mol/ 1 , X 2 = 3.325 mol/ 1 , xs = 323.6 K, X 4 = 305.2 K, x = 
0.04446; 

xi = 0.0008258 mol/1, X 2 = 0.6927 mol/1, xs = 406.2 K, X 4 = 334.6 K, 
X = 0.8009; 

as well as the unstable stationary point: 

xi = 0.06148 mol/ 1 , X 2 = 2.703 mol/ 1 , X 3 = 343.1 K, X 4 = 312.1 K, x = 0.2232. 
The two stable points had been previously determined with the help of trajec- 
tories: the first solution corresponds to the low conversion point and the second 
solution to the high conversion point. Moreover, this method allows us to place 
with accuracy the unstable stationary point. 

To determine the frontier separating the domains of asymptotic stability, 
the technique described in Sect. 3.2.1 was used. Stationary points have just 
been determined. Starting from the neighbourhood of the unstable saddle 
point, the integration of the system of the form (3.13), when the polymeriza- 
tion reactor model is symbolized by the form (3.6), was realized. The frontier 
separating the two domains of asymptotic stability was thus obtained and was 
thus materialized by the two curves added to Fig. 3.6 compared to Fig. 3.5. 

The eigenvalues associated with the linearized system around the stationary 
point, obtained from (3.24), have been calculated for the three stationary points 
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Figure 3.6: Phase portrait: conversion versus temperature for a styrene po- 
lymerization reactor. On this hgure, to be compared with the previous hgure, 
the frontier between the two domains of asymptotic stability is drawn 



and have given the following results: 

• Low conversion point: 

Ai = -0.366 X 10-^ ; A 2 = -0.88 x 10“^ 

A 3 = -0.1024 X 10-3 + i 0.63 x lO-^ ; A 4 = -0.1024 x 10“3 - i 0.63 x lO-^ 
thus this point is a stable focus. 

• High conversion point: 

Ai = -0.66 X 10-2 ; A 2 = -0.44 x 10-^ 

A 3 = -0.20 X 10-2-^0.12 X 10-2; A 4 = -0.20 x 10-2 + ^0.12 x IO -2 
thus this point is a stable focus. 

• Unstable intermediate point: 

Ai = 0.99 X 10-^; A 2 = -0.945 

A 3 = -0.72 X 10-^ - i 0.19 X 10-^ ; A 4 = -0.72 x 10-^ + i 0.19 x 10-^ 
thus this point is a saddle point. 

A classical explanation of the existence of multiple stationary points for the 
chemical reactor in which an exothermic reaction occurs (case of polymerization 
reactions) is the following (Van Heerden, 1953). A high temperature induces a 
high reaction rate. The shape of the curve of heat Qg generated by the chemical 
reaction with respect to reactor temperature is sigmoidal (Fig. 3.7). In the case 
of the polymerization reaction studied in this section, the heat Qg generated 
by the reaction (counted positively as received by the system) is equal to 

Qg = V{-AHkp[M]V) (3.27) 

The reactor is cooled by a cooling fluid circulating in a jacket at mean tem- 
perature Tj, and the transferred heat (given to the cooling fluid) through the 
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Figure 3.7: Diagram of heat power versus reactor temperature 



wall separating the jacket from the reactor is 

Qtr = UA{Tr-Tj) (3.28) 

where U is the global heat transfer coefficient through the exchange surface of 
area A. As the reactor is fed by a stream at temperature Ta lower than the 
reactor temperature T^, the cooling capacity of the reactor (counted positively 
if Tr > Tj) is equal to 

Qr = u A (Tr — Tj) -\- P Cp Fq {Tr — Ta) (3.29) 

with monomer concentration [M] = X 2 , reactor temperature Tr = X 3 and mean 
temperature of the cooling fluid in the jacket Tj = X 4 , which gives a straight line 
in the diagram (T^, Q). If the slope of the cooling line is insufficient, the cooling 
line and the curve of the generated heat present three crossing points which 
have been previously commented on. The limit corresponds to the tangent at 
the inflection point of the sigmoid. The slope of the cooling line is a linear 
function of the inlet temperature Tee of fho cooling fluid; it also depends on 
the flow rate Fc of the cooling fluid and on the global heat transfer coefficient 
UA. 

To obtain point by point the curves of Fig. 3.7, we assumed that the reactor 
temperature Tr is perfectly controlled, thus Tr is considered no more as a state 
variable, and that the continuous reactor has reached its steady state. Then it 
was sufficient to make temperature Tr vary in an adequate domain. 

In chemical engineering practice, it is frequently desirable to control the 
reactor at the unstable stationary point. 
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3.2.4 State-Space Analysis of a Linear System 

Consider a general linear system defined in state space by 

{ X = Ax ^ B u 
= C X A D u 



(3.30) 



and having riu inputs n states Xi and Uy outputs yi. This system thus 
includes n differential equations, the stability of which must be studied. Indeed, 
if the states are bounded, the output, which is a linear combination of the states 
and the inputs, will be bounded. 

On the other hand, it is sufficient to study the stability with respect to only 
one input, as for this linear system, the obtained properties will be able to be 
extended to all other inputs. Thus, the following system will be studied 



{ Xi — ail Xi + . . . + aifi Xfi -\- bii ui 

Xfi = ^nl a^i + . . . + ajin Xfi + bnl U\ 



(3.31) 



The input u can be assumed to be zero. Indeed, a constant input corresponding 
to an input step of amplitude k produces a response which can be decomposed 
into two parts: the forced response linked to the amplitude of the input and 
the natural response linked to the system itself. As the problem is to charac- 
terize the system, the natural response is sufficient. Under these conditions, 
exponential solutions of the form Xi{t) = fi exp(5t) of this system are searched 
for. The system can then be written 

s /i exp{st) = an /i exp{st) + . . . + ai„ /„ exp{st) 

< : (3.32) 

^ s fn exp{st) = a„i /i exp{st) + . . . + a„„ /„ exp{st) 

or, after arrangement, 

[A-sI]f = 0 (3.33) 

where / is the vector of coefficients fi and I is the identity matrix. So that 
the solution of this system is different from the trivial solution (all Si zeros), 
the determinant of {A — s I) must be zero 

det([A — 5 /]) = 0 (3.34) 

This equation is, in fact, the characteristic polynomial of matrix A and the roots 
Si are the eigenvalues of A. The state-space stability condition is deduced: 

so that a linear system, defined in state space, is stable, it is necessary and 

sufficient that all eigenvalues of matrix A are negative real or with a negative 
real part. 

It might have been possible to start from system (3.31), perform Laplace 
transformation on this system, and solve. The results would have been identi- 
cal. 
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3.3 Stability Analysis of Feedback Systems 

Previously, it was shown that systems exhibit a different behaviour when they 
are in open loop or closed loop. The closed-loop response to a set point or a 
disturbance variation is inffuenced by the presence of actuators, measurement 
devices and, of course, controllers. To design a feedback control system, it is 
necessary to proceed to a stability study. 

In open loop, the output Y (s) is simply related to the control variable U (s) 
and to the disturbance B(s) by the relation 

T(s) = U(s) + D(s) (3.35) 

If the system is subjected to a known input, e.g. a step, the asymptotic be- 
haviour will be found by using the final value theorem 



lim y(t) = lim 5 T( 5 ) 

t^oo s^O 



(3.36) 



One already knows that when the process transfer function possesses positive 
real poles, or complex poles with a positive real part, the process is naturally 
unstable. The same reasoning can be made in closed loop (Fig. 2.19), the 
output Y (s) being related to the set point Tr(^) and to the disturbance by the 
relation 

y(s) = G;(s)ras) + g'As)d(s) (3.37) 

with the closed-loop transfer function relative to the set point 



Ge(s) Gg(s) Gp(s)K^ 

1 + ^c(^) ^a(^) Gp(^) ^m(^) 



(3.38) 



and the closed-loop transfer function relative to the disturbance 



^d(^) 



1 + ^c(^) ^a(^) Gp(^) ^m(^) 



(3.39) 



Both closed-loop transfer functions G^^(s) and G'^{s) can be written as the 
ratio of two polynomials 



deg(AT') < deg(D') 
G'dis) = ^ 7 ^ deg(AT") < deg{D") 



(3.40) 



Conditions relying on the respective degrees of numerator and denominator 
express the fact that the system must be physically realizable (condition of 
physical realizability). 

The closed-loop stability analysis will consist of the study of the positions 
in the complex plane of the poles of the closed-loop transfer functions, hence 
the zeros of denominators D'{s) and D''{s). 
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In general, the open-loop transfer function Gd relative to the disturbance 
has no common poles with Gc, Ga^ Gp and Gm because these different transfer 
functions are obtained by identification or calculation in a real process. 

If the transfer function Gd has common poles with Gc^ Ga^ Gp or a 
particular study must be realized to take it into account. If the transfer function 
Gd has unstable poles which are not cancelled by those of Gc, Ga^ Gp or Gm^ 
the closed-loop transfer function relative to the disturbance has the same 
poles and the feedback system is thus unstable for any disturbance. 

Now study the case of the closed-loop transfer function G'^{s) relative to 
the set point. In this case, as the poles of [Gc Ga Gp] are common with those of 
[1 + Gc Ga Gp Gm] (which has only in addition the poles of Gm)^ it is equivalent 
to study the zeros of the characteristic equation denoted by 

^ ^ G c{s) G a{s) G p{s) G m{s) = 0 
l^Goi{s)=0 

where Goi{s) is the transfer function of the open loop, i.e. the product of all 
the transfer functions found in the loop of Fig. 2.19 (open must be considered 
as open between the measurement device and the comparator). When the 
characteristic equation has positive real zeros, or complex zeros with a positive 
real part (values in the right half 5-plane) the closed-loop system is unstable. 

Let us study the case of the closed-loop transfer function G'^{s) relative to 
the disturbance. We notice that its poles are those of the open-loop transfer 
function Gd^ to which must be added the zeros of the characteristic equation. 
Thus it will be sufficient to complete the analysis of the characteristic equation 
with that of the transfer function Gd- In the case where the transfer function 
relative to the disturbance Gd has a positive pole or a pole with a positive 
real part, and that this pole is not common to Gc-, Ga^ Gm or Gp^ this pole 
cannot be cancelled in closed loop and the system is unstable with respect to 
the disturbance, whatever the controller choice. 

3.3.1 Routh-Hurwitz Criterion 

The search of the zeros of the characteristic equation is far from evident without 
calculator or computer. Routh-Hurwitz criterion allows us to know analytically 
if at least one zero is situated in the right half 5-plane which corresponds to an 
instability. 

The characteristic equation is expressed in the form of a rational fraction 
where only the numerator will be considered 

1 + Gc{s) Ga{s) Gp{s) Gm{s) ~ tS \ (3.42) 

with: 

N {s) = + ai5^ ^ + . . . + Gfi—is + Qfi (3.43) 

where ao > 0 (else, the whole polynomial is multiplied by —I). 

Property: 
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The Routh-Hurwitz criterion is a necessary and sufficient condition for the 
stability analysis: the number of roots of N{s) that are positive real or complex 
with a positive real part is equal to the number of changes in sign of the first 
column of the associated array. 

A polynomial with real coefficients is called Hurwitz if all its roots have a 
negative real part. 

First search: if at least one coefficient is negative, there exists at least 
a positive real zero or complex with a positive real part, and the polynomial 
N{s) is called unstable. 

Second search: if all ai are positive, the following array is formed by n + 1 
rows: 



row 




ao 


CL 2 


a 4 


row 


a^n— 155 


ai 


as 


as 


row 


a^n — 255 


h 


64 


h 


row 


a^n — 355 


C3 


C5 


C7 


row 


“gi” 


jn—l 






row 


“gO” 


kn 







where the first two rows are formed by the coefficients of the studied polynomial. 
The following rows of the array are related by the following relations: 
first the row of “5^-255. 

7 G1CI2 — ^0^3 h CI1CI4 — noU 5 7 GiCIq — ( 20^7 

^2 — ad ~ ad ~ ad 

then the row of “5^-255. 

_ b 2 a 3 — _ ^ 2 a 5 ~ cgbe _ ^ 2 a 7 ~ cgbs 

62 &2 &2 

then the row of 

d4 = ^ 3 & 4 -& 2 C 5 = C3&6-&2C7 ... 

The array so formed is triangular, except for the last two rows with only one 
element per row. 

Consider the elements of the first column of the array: ao, ai, 62, C3, . . ., 
kn- If one of these elements is negative, at least one zero is positive real, or 
complex with a positive real part: the polynomial N{s) is unstable. 



Example 3.3: Stability Analysis Using Routh-Hurwitz Criterion 

According to Fig. 3 . 8 , let ns consider a process with the following transfer 
function 



(s + 4)(s + 6) 



(3.44) 



measurement Gm = 8 and actuator 






1 

F+F 



( 3 . 45 ) 



Let US suppose that a simple proportional controller is used {Gc = Kc) as is 
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Figure 3.8: Block diagram for Example 3.3 



often done for PID tuning (Sects. 4.5.1, 5.5). The characteristic equation is 



1 + GcGaGrpGjn — 0 



! + ■ 



\Kr 



= 0 



(5 + 1)[S + 4)(5 + 6) 
or: (5 + 1)(5 + 4) (5 + 6) + 8Kc = 0 5 ^ + 11 5 ^ + 34s + 24 + 8Kc = 0 

(3.46) 

All coefficients of this polynomial are positive, thus the second search phase 
must be performed. 

Set the Routh-Hurwitz table: 

34 

11 24 + 8iCc 



11 X 34 - 1 X (24 + 8iCc) 

n 



row 24 + 8Kc 

Assuming Kc to be positive, the only element of the hrst column that is a 
potential problem is the coefficient of line “ 5 ^” which must be positive. One 
obtains 

11 X 34- 1 X (24 + 8iCc) > 0 ^ iCc < 43.75 (3.47) 



When the gain of the proportional controller is larger than the limit value 43.75, 
some roots of the characteristic equation become positive or have a positive real 
part, and the closed-loop system becomes unstable. 



3.3.2 Root Locus Analysis 

In the examples of the previously studied controller, it was shown that the 
controller gain influences the closed-loop response. The root locus (or Evans 
locus) is the set of the points of the complex plane described by the roots of the 
characteristic Eq. (3.41) when the controller gain varies between 0 and inhnity. 
The values of the gain for which the roots are situated in the right half 5-plane 
correspond to an unstable closed-loop system. 

Nowadays, packages allow to easily hnd the roots of a polynomial, thus the 
zeros of the characteristic equation and thus allow us to easily obtain exact 
root loci. However, some indications are given in order to approximately draw 
these root loci: 
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• As the characteristic polynomial (3.43) has real coefficients, its roots are 
either real or a conjugate complex, thus the root locus is symmetrical with 
respect to the horizontal real axis, 

• One begins by placing the poles of the open-loop transfer function GqI (Eq. 
(3.41)) and they are denoted differently (by a circle in Fig. 3.9) from the 
future closed-loop poles (which will be denoted by a cross in Fig. 3.9), 

• The asymptotic lines of the root locus are determined. These asymptotes are 
obtained by making s tend towards infinity. The number N of asymptotes 
is equal to the relative degree of the open-loop transfer function GqI^ i.e. 
the difference between the degrees of the denominator and the numerator 
of Goi^ or still the number of poles rip of GqI minus the number of zeros riz. 

The trigonometric angles of the asymptotic lines are equal to 

ziz f 2 /t I ]_^7r 

Angles of the asymptotes = — /c = 0, 1,...,A^ — 1 (3.48) 

Note that an asymptote having an angle equal to tt is the half real negative 
axis, i.e. a line starting from a given real value and tending towards — oo. 

Let us denote the zeros of GqI by Zi and its poles by GqI can be 
rearranged as 



Goi 





sUp-Uz ^ ^ - . . . - + . . . 



(3.49) 



where Kc is the controller gain. As the characteristic equation corresponds 
to Goi{s) = — 1, and we are interested in the behaviour when s tends towards 
infinity, considering the two higher-degree terms gives an approximation of 
the equation giving a unique value of s as 



/ ^ {pi + ---+Pn„)-{zi+... + Z^j y-^ ^ ^ 

V np-Uz ) 



(3.50) 



The position of the point of the real axis which is the intersection of the 
asymptotes is thus deduced 



(pi + . . . + Pn ) — (zi Zn^ ) 

5 = 

rip — riz ^ 2 ) 

sum of poles — sum of zeros ^ ‘ ^ 

number of poles — number of zeros 

• The number of branches is equal to the degree of the characteristic equation, 
i.e. the number of poles for a proper transfer function. 

The branches of the root locus start (when Kc 0) at the poles of GqI 
and end (when Kc ^ oo) at the zeros of GqI- If the number of closed-loop 
poles is equal to the number rip of open-loop poles, the number of branches 
ending at finite zeros is equal to the number riz of open-loop zeros. The 
other rip — riz branches tend towards the asymptotes at infinity (the zeros 
are called zeros at infinity). The point where the branches break away 
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corresponds to multiple roots of the characteristic equation and is a solution 
of the following equation 




(3.52) 



where Kc has been expressed as a function of s deduced from the charac- 
teristic Eq. (3.42). 

• It is possible to determine other characteristics in order to obtain more 
precisely the branches of the root locus (Ogata, 1997). 

Example 3.4: Root Locus 



Consider again Example 3.3. According to Eig. 3.8, the process has the follow- 
ing transfer function 



(s + 4)(s + 6) 



(3.53) 



measurement: Gm = 8 and actuator 






1 



(3.54) 



The open-loop transfer function is thus equal to 



Goi = GcGaGMm = Gc-, T7 77 (3.55) 

^ (s-f l)(s-f 4)(s-f 6) ^ ^ 

Its poles are negative so that the system is stable in open loop. We wish to 
study the influence of the controller gain and we assume that the controller is 
a simple proportional one: Gc = Kc. The characteristic equation is thus 



1 + GqI — 0 (5 + 1)('5 T 4)('^ + 6 ) + ^Kc — 0 (3.56) 



Its solutions depend on the values of the proportional controller gain. 

According to the previous rules, we notice that GqI presents three poles and 
no zero. It follows that the root locus (Eig. 3.9) will have three asymptotes at 
7t/3, 7T and — tt/ 3 which intersect at the real axis point of abscissa: —(I + 4 + 
6)/3 -3.66. 

The branches start at the poles of GqU he. at the points on the real axis of 
abscissae —I, —4 and —6. 

The two branches having a complex part start at —I and —4 and break 
away at the point that we hnd from Eq. (3.56) by setting 



dKc 

ds 



= 0 



with: Kc = — 



{s + l){s + A){s + 6) 



(3.57) 



which gives the two roots —5.12 and —2.21. Only the root —2.21 conveys 
which corresponds to the value Kc = 1.026, at which we begin to obtain com- 
plex closed-loop poles (or roots of the characteristic equation). The other root 
corresponds to a negative value of Kc. 

The third branch on the real axis starts from —6 and is directed towards 



— oo. 
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Figure 3.9: Root locus diagram of a third-order open-loop transfer function. 
The poles of the open-loop transfer function and the closed-loop poles are 
respectively represented by “o” and by “x”. The asymptotes are represented by 
continuous lines 



In Fig. 3.9, the root locus (symbol “x”) was obtained for gain values included 
between 0.1 and 1.5 by increments of 0.1, then between 5 and 60 by increments 
of 5. Moreover, the figure displays the open-loop poles (here, points —1, —4 and 
—6 of real axis). For gain values smaller than 1.026, the three closed-loop roots 
are real and two of them become conjugate complex beyond this value. More- 
over, when the gain becomes larger than around 40, these conjugate complex 
roots have their real part positive: in closed loop, the system becomes unstable 
for high values of the controller gain although it is stable in open loop. 



Example 3.5: Root Locus 



According to Fig. 3.10, let us assume that a process has the following transfer 
function 



Gp{s) 



4(25 + 1) 

(55 + 1)(2552 + 55 + 1 ) 



(3.58) 



and that Gm = 1 and = 1. 

The open-loop transfer function is thus equal to 



^ ^ 
^ ol — 



4(2s + l) 



(5s + 1)(25s2+5s + 1) 



(3.59) 



Besides Gc, one pole is negative real (s = —0.2) and the two other poles have 
a negative real part (s = —0, 1 ± 0, 1732i), thus the system is open-loop stable. 
Moreover, Goi has a negative real zero (s = —0.5). To obtain the root locus, we 
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Vr 



+ 



a 







4(2s + l) 


y 


-► Ltc 




(5s + 1)(25s2 + 5s + 1) 





Figure 3.10: Block diagram for Example 3.5 



assume that the controller is simply proportional: Gc = The characteristic 
equation takes the form 



l + Goi =0 (5s + 1)(25s2+5s + 1) + 4Kc(2s + 1) =0 (3.60) 

Goi presents three poles and a negative zero. The relative degree of the open- 
loop transfer function GqI is equal to the difference between the degree of the 
denominator and that of the numerator, i.e. 2. The root locus (Fig. 3.11) will 
have two asymptotes at it/ 2 and — tt/2, which intersect at the point on the real 
axis of abscissa: —((—0.2 — 2 x 0.1732) — (— 0.5))/(3 — 1) ^ 0.05. 




Figure 3.11: Root locus for open-loop transfer function (3.59) of relative de- 
gree 2. The poles of the open-loop transfer function and of the closed-loop 
transfer function are respectively represented by ”o” and by ”x”. The asymp- 
totes are represented by continuous lines 

The root locus has three branches starting at the poles of GqI’- one of the 
branches corresponding to the real roots starts from the pole of GqI on the real 
axis at abscissa —0.2 and ends at the zero of GqI at abscissa —0.5; the two other 
branches correspond to the conjugate complex roots and start at the points of 
the complex plane of affixes —0.1 ± 0.1732h 
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Figure 3.11 gives the root locus (symbol “x”) obtained for gain values suc- 
cessively comprised between 0.1 and 0.5 in increments of 0.1, then between 1 
and 5 in increments of 1, then between 10 and 50 in increments of 5. More- 
over, the hgure shows the open-loop poles (here, point —0.2 of the horizontal 
real axis and points —0.1 ± 0.1732i in the complex plane). It appears clearly 
in the hgure that the root loci originate from the open-loop poles. When the 
gain is larger than about 3.77, two roots have their real part positive and the 
closed-loop system becomes unstable. 



3.3.3 Frequency Method 

The frequency method consists of searching the frequency co and the maximum 
gain Kcm of fho proportional controller such that the characteristic equation 
is equal to zero for these values (see Chap. 5). The analysis is realized in the 
complex plane and it is sufficient to set 

s = j u (3.61) 

Example 3.6: Stability Analysis in Frequency Domain 

Take again the previous Example 3.3 where the open-loop transfer function is 
equal to 

g 

Gol = GcGaGMm = Gc-, T 7 W 7 r (3.62) 

^ (5 + 1)(5 + 4)(5 + 6) ^ ^ 

with Gc = Kc (the controller is set in proportional mode) and the characteristic 
equation to be solved becomes 

{s + 1) (5 + 4) (5 + 6) + 8 Kq = 0 

4) (j cj + 6) + 8 Kc = 0 
(-llcj2 + 24 + 8 Ac) + j(-cj^ + 34cj) 
r -llcj2 + 24 + 8Acm =0 
\ -u;^ +34cj = 0 

from which we draw the frequency co = 5.831 radians/time unit and the maxi- 
mum gain of the proportional controller Kcm = 43.75, beyond which the system 
becomes unstable. For this limit gain, the system subjected to a set point step- 
like variation would present an output showing sustained oscillation. 
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Chapter 4 

Design of Feedback 
Controllers 



Until this chapter, the problem of getting the best values for the controller pa- 
rameters so as to get the “best” possible response was not our objective. In this 
chapter, solutions will hrst concern classical PID controllers. The important 
problems are: 

• The choice of the controller type. 

• The tuning of the controller parameters. 

• The performance criteria to be used. 

After the PID controller, more sophisticated control methods will be explained 
such as internal model control, pole-placement control and linear quadratic 
control. 



4.1 Performance Criteria 

The controller role for the closed-loop system is to guarantee that the response 
presents suitable dynamic and steady-state characteristics (Kestenbaum et ah, 
1976). 

The following criteria can be cited: 

• The controller must be able to maintain the controlled variable to its set 
point. 

• The closed-loop system must be asymptotically stable and present a satis- 
factory performance in a large range of frequencies. 

• The influence of disturbances must be minimized. 

• The responses to set point variations must be fast and smooth. 

• An excessive control must be avoided (the control variable u must not un- 
dergo too frequent large and fast variations). 

• The control system must be robust: it must be insensitive to process varia- 
tions and modelling errors. 
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In reality, all these objectives cannot be simultaneously realized and the control 
system (Fig. 4.1) results from a compromise. For industrial applications, the 
robustness is particularly important. 



External 

Disturbances 




Figure 4.1: Summary of the problems of a controller design for process control 

The more frequently used criteria (cf. Fig. 1.29) are: 

• The overshoot. 

• The rising time. 

• The stabilization time. 

• The decay ratio. 

• The oscillation frequency of the transient. 

Different methods are possible: 

• “Direct synthesis” method. 

• Internal model control. 

• Pole-placement. 

• Optimal control. 

• Tuning relations. 

• Frequency response techniques. 

• Computer simulation based on knowledge models. 

• On-site tuning. 

In the context of this chapter, the first five methods are based on continuous 
transfer function models or Laplace polynomials; frequency response techniques 
can be used for any linear model. Optimal control based on state-space models 
will be studied in Chap. 14. Computer simulation allows us to use any type of 
model, in particular knowledge models based on fundamental principles, but is 
longer in this case and needs a lot of human investment. 
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4.2 Transient Response Characteristics 



The characteristics of the transient response depends on the position of the 
closed-loop transfer function poles. The dominant poles which are the closest 
to origin are retained. When only a given pair of complex poles is retained, 
the reduced system has the characteristics of a second-order system with the 
following closed-loop transfer function 



G{s) 



uo 



2 

n 



Uj‘^ T 2 ^ UJfiS + 5^ 



(4.1) 



with damping factor ( and natural frequency On the left half 5-plane, 
corresponding to stable poles, different features can be drawn (Fig. 4.2) for a 
step response: 






The half straight lines of constant overshoot corresponding to relation (1.157) 
hence 



y = ± — T — ^ 



with: = constant < 1 



(4.2) 



the damping factor must not be too low; the first bisecting line corresponds 
to value C = \/2/2. 

• The half circle of constant rising time corresponding to relation (1.160) 
hence 

2.5 cy cy cy 

tjn ^ constant x y ~ with: x <0 (4.3) 

UJn 



• The straight line of constant stabilization time tg corresponding to relation 
(1.162) hence 



tg ~ ~ constant x = —CcOn ~ (4.4) 

Figure 4.2 allows us to evaluate the infiuence of the pole position on these three 
frequently used criteria: the hatched zone must be avoided. 

Moreover, in general, the real part of the poles must not be too small (its 
absolute value must not be too large), because in the opposite case, it would 
correspond to a very fast response but would probably have the consequence 
of saturating the manipulated variable. 



4.3 Performance Criteria for Design 



Denoting as an error the difference e{t) = y{t) — yr{t) = (output — set point), 
the criteria dealing with the integral of an error function take into account the 
global nature of the process response. Several criteria exist: 

• Integral of the square of the error (ISE) 



ISE 




dt 



( 4 . 5 ) 
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Constant 

stabilization 




Figure 4.2: Loci of constant overshoot, rising time and stabilization in the 
5-plane. The hatched zone is not convenient for the poles 



• Integral of the absolute value of the error (lAE) 

pOO 

IAE= / \e{t)\dt (4.6) 

Jo 

• Integral of the absolute value of the error weighted by time (ITAE) 

pOO 

ITAE= / t\e{t)\dt (4.7) 

Jo 

The problem is then to choose the controller type and its parameters so as to 
minimize one of the previous criteria. The criteria can be ordered with respect 
to their own characteristics: 

• Use of lAE gives well-damped systems; for a second-order system, the result- 
ing damping factor will be around 0,7 (Shinners, 1992) nearly as for ITAE. 
With ISE, it would be around 0,5. ISE is not very sensitive to parameter 
variations, as opposed to ITAE which is sensitive. 

• To suppress large errors (numbers >1), ISE is better than lAE because the 
error term intervenes by its square. 

• To suppress small errors (numbers < 1), lAE is better than ISE. 

• To suppress errors which persist a long time, ITAE is the best criterion 
as the t term amplifies small persisting errors. This is often the preferred 
criterion because it offers more security. In general, it will produce smaller 
overshoots and oscillations. 
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Graham and Lathrop (1953) have searched the best closed-loop transfer 
functions with respect to the ITAE criterion. Indeed, it is possible to consider 
several types of transfer functions Gd'. 

— When the numerator is a constant, the transfer function has no zeros: this 
corresponds to an error of position which is zero for a step input. The 
denominator will take one of the forms given by the top array of Table 4.1. 

— When the numerator degree is 1, the transfer function possesses one zero. 
It will be designed to ensure a zero position error and a zero velocity error 
for a ramp trajectory. The denominator will take one of the forms of the 
middle array of Table 4.1. 

— When the numerator degree is 2, the transfer function possesses two ze- 
ros. It will ensure a zero acceleration error for parabolic trajectories. The 
denominator will take one of the forms of the bottom array of Table 4.1. 

It must be noted that the higher the demand with respect to the transfer 
function Gc/, the more zeros it possesses and the more its transient behaviour 
can itself deteriorate (Fig. 4.3). 



Table 4.1: Denominators of the optimal transfer functions for the ITAE 
criterion and for systems with zero position error, zero velocity error or zero 
acceleration error 



Denominator of transfer function Gd optimal for ITAE 
For a system with zero position error 

Gd{s) = oJq / (s^ + ^ + . . . + ais + cJq ) 

s -\- UJq 

5^ T \ T cjq 
T 1.75cjq'^^ T 2.15cJq5 T oJq 
5^ T 2.1cjq'^^ T 3.4cJq5^ T 2.7cJq5 T uJq 
T 2.8cjq'^^ T 5.0cjq5^ T 5.5cjq5^ T 3.4cjq5 T cjq 
+ 3.25cjqs^ + + S.Gcu^s^ + 7 + 3.95cj^s + cjg 

For a system with zero velocity error 
Gd{s) = (ais + )/ ^ + . . . + ais + cJq ) 

5^ T 3.2cjq'^ T ^0 
T 1.75cjq'^^ T 3.25cjq5 T cjq 
5^ T 2.41cjq'^^ T 4.93cjq5^ T 5.14cjq5 T cjq 
5^ + 2.19cJo5^ + + 6.30 cj^52 + 5.24cj^5 + 

+ 6.12cjqs^ + 13.42cjgs^ + 17.16cjgs^ + U.Uuu^s^ + 6.76uj^s + cjg 
For a system with zero acceleration error 

Gd — (^2 + CJq ) / ( 5 ^ + ^ + . . . + ai5 + UJq ) 

T 2.97cjq'^^ T 4.94cJq5 T oJq 
5 ^ T 3.71cjq'^^ T 7.88cjq5^ T 5.93cjq5 T cjq 
5 ^ + 3.81cJo5^ + + 13.44 cj§52 + 7.36cj^5 + 

+ 3.93cjqs^ + 11.68cjgs^ + 18.56cjgs^ + 19.3Qcj^s^ + + cjg 
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In Table 4.1, the value of cjq is not given. It depends on the desired dynamics 
for the system; in some cases, it can be determined with respect to constraints 
existing on the actuator: iXniin ^ u{t) < iXmax- 




Figure 4.3: Responses to a step input for optimal transfer functions with 
respect to the ITAE criterion in the case of a zero position error. The number 
on the figure indicates the transfer function order 



4.4 Choice of PID Controller 

4.4.1 General Remarks 

Up to now, a given number of generalities can be drawn: 

• Proportional action: 

— Accelerates the process response by gain increase. 

— Produces a steady-state deviation except for processes which have an 
integrator term (l/s) in their transfer function. This offset decreases 
when the proportional gain increases. 

• Integral action: 

— Eliminates the steady-state deviation. 

— This elimination is done in general at the expense of larger deviations. 
— Responses are sluggish, with long oscillations 

— The increase of gain K makes the behaviour more oscillatory and can 
lead to instabilities. 

• Derivative action: 

— Anticipates future errors. 

— Introduces a stabilizing effect in the closed-loop response. 




Process Control 



135 



4.4.2 Recommendations 

Simple Rules 

1) If possible, use a P controller: if the steady-state deviation is tolerable, or if 
the process contains an integral term. Example: gas pressure control or level 
control. 

2) If the P controller is unacceptable, use a PI: if the steady-state deviation is 
too large. Example: flow rate control. In this case, the response is fast and 
the slowing induced by integral action is unimportant. Astrdm and Hagglund 
(1988) recommend using a PI controller for processes that have hrst-order dy- 
namics, e.g. level control in a tank. 

3) In other cases, use a PID: the closed-loop response will be faster and the 
controller will be more robust. Example: temperature control, composition 
control, processes with capacities in series. Astrom and Hagglund (1988) rec- 
ommend using PID controllers for processes having second-order dynamics, 
which sometimes may be difficult to detect, or having time constants of dif- 
ferent orders of magnitude. Owing to the derivative action, the gain can be 
limited. 

4) Typical systems posing serious problems for PID are: 

— Systems with time delay. 

— Systems with oscillatory modes. 

— Systems with large parameter variations. 

— Systems for which a quality variable should be controlled. 



More Detailed Discussion 

This advice is valid for use in the case where there is no available process model; 
they do not constitute instructions for use without reservation and will always 
need to be used with care. 



Flow rate control. Control feedbacks for flow rate and liquid pressure are 
characterized by fast responses so that the delays are, in general, negligible. 
The sensor and the pneumatic transmission lines can introduce a time delay. 
Disturbances are, in general, high-frequency noise, which makes the derivative 
action unusable. PI controllers are frequently used. 



Liquid level control. Integral action is not necessary if a small deviation 
(around 5%) is tolerated at the level. If integral action is used, high gains 
can be chosen due to the integrating nature of the process. In general, the 
derivative action is not used. In many cases, a buffer tank is used to avoid level 
fluctuations in the plant. In this case, the outlet flow rate of the tank must be 
as stable as possible and the controller will have to be carefully tuned. 

When heat transfer occurs in the tank (reboiler, evaporator), its operating 
model is more complicated and the controller will be different. 
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Gas pressure control. If the gas is in equilibrium with a liquid, gas pressure 
control is difficult. Here, pressure control is considered for a gas alone. The 
tank (or the pipe) partially controls itself: if the pressure inside the system 
becomes too high, the feed flow rate decreases, and vice-versa. In general, PI 
controllers are used with a small integral action (r/ large). Often, tank volumes 
are small so that the residence times are low with respect to the rest of the 
process and derivative action is not necessary. 

Temperature control. Temperature control problems are complicated and 
of a large variety with respect to the considered system. Frequently, because 
of occurring time delays, the gain must not be too large to avoid instability. 
PID controllers are often used to have a faster response than with a PI and to 
stabilize the process. 

Composition control. Several points are common with temperature con- 
trol, but two additional factors intervene: 

— The measurement noise is more important. 

— Time delays can be very large (example: chromatographs). 



4.5 PID Controller Tuning 

In many cases, the technician or engineer faces an existing plant and must do 
on-site tuning. 

4.5.1 Tuning by Trial and Error 

A typical on-site tuning of a PID controller (later, the connection with the 
Bode stability criterion (Sect. 5.5) and Ziegler-Nichols tuning will appear) is 
done as follows: 

• Stage I: The controller is set in proportional mode only (it suffices to take 
T/ maximum and td minimum). 

• Stage 2: A low value of the gain is chosen and the controller is set in 
automatic mode. A step set point variation is operated. 

• Stage 3: The gain is increased by small increments until a sustained oscil- 
lation of period is obtained (corresponding to the ultimate gain Kcu)- 

• Stage 4: The gain is reduced by a factor of 2. 

• Stage 5: r/ is decreased by small increments until a sustained oscillation is 
again obtained, r/ is set to three times this value. 

• Stage 6: td increased until sustained oscillation is obtained, td is set to 
one third of this value. 

During this operation, it is necessary to avoid saturating the controller output. 
Otherwise, oscillations can occur at gain values lower than the ultimate gain 
Kcu- In principle, when the gain is lower than the ultimate gain, the closed-loop 
response Ua{t) (the controller output) is underdamped or weakly oscillating. 
When the gain is larger than the ultimate gain, the system is unstable and the 
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response is theoretically unbounded. As a matter of fact, saturation will occur 
very often. 

Conclusion: the ultimate gain Kcu is the highest value of the controller gain for 
which the system is closed-loop stable, the controller being only in proportional 
mode. 

Drawbacks: this procedure can be unsafe, e.g. it can lead to reactor runaway. 
Moreover, it cannot be applied to processes which are unstable in open loop. On 
the other hand, some processes have no ultimate gain: e.g. a process perfectly 
modelled by a hrst- or second-order transfer function and having no time delay. 
Nevertheless, the rules which are given clearly show the influence of each action 
and provide a line of conduct: when the tendency to sustained oscillations is 
detected, a safety margin is adopted for the controller parameters. 



4.5.2 Sustained Oscillation Method 

The method presented in the previous section is only a variant of the contin- 
uous cycling method by Ziegler and Nichols (Ziegler and Nichols, 1942). This 
method consists of determining as previously the ultimate gain. The sustained 
oscillation period is called the ultimate period T^. Then the tuning recom- 
mended by Ziegler and Nichols can be used (Table 4.2). Furthermore, this 
only constitutes a hrst tuning which must be rehned. 



Table 4.2: PID tuning recommended by Ziegler and Nichols 



Controller 


Kc 


Tl 


Td 


P 


0-5 Kcu 






PI 


0A5Kcu 


Tu/l.2 




PID 


0.6 Kcu 


T„/2 


Tu/8 



It can be noticed that the gain of the PI controller is lower than that of the P 
controller, and that of the PID controller is the highest, thus corresponding to 
the general tendencies of the integral and the derivative actions. 

Indeed, these empirical settings based on a decay ratio equal to 0.25 are 
not necessarily the best; they cause a non- negligible overshoot, as a decay ratio 
equal to 0.25 gives a damping factor ( = 0.22, which is too low. Different 
settings are proposed (Perry, 1973) for PID controllers in Table 4.3 which 
produce less overshoot but may, however, not produce the foreseen results. 

The tuning “without overshoot” corresponds of course to the lowest con- 
troller gain, but a small overshoot is frequently tolerated. 

It may happen that one type of tuning is convenient for a set point variation 
but is less convenient when the system is subjected to a disturbance. 
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Table 4.3: Tuning of a PID giving less overshoot than Ziegler and Nichols tun- 
ing. Note that these recommendations by Perry (1973) do not always provide 
the expected results 



Controller with 


Kc 


Tl 


Td 


Light overshoot 
No overshoot 


0.33 Kcu 

0.2 Kcu 


Tu/2 

Tu/2 


T„/3 

T„/3 



4.5.3 Relay Oscillation Method 

The method of sustained oscillation in the neighbourhood of instability is 
favourably replaced by the observation that a relay interposed in the loop be- 
fore the actuator (Fig. 4.4) allows to generation of a control signal having the 
shape of a square wave, which leads to an oscillating measured output with the 
critical or ultimate period Tc. Indeed, the relay works as an on-off controller. 
The only influence of the proportional controller of Fig. 4.4 is on the relay 
excitation signal. Consequently, its gain is chosen to be equal to 1 (as if the 
proportional controller was absent) or a value which does not prevent the relay 
from oscillating. Moreover, the output oscillation amplitude is proportional to 
the relay amplitude and can be chosen to be small so as to little perturb the 
process. Thus this method is clearly more admissible in a real process than the 
trial and error tuning method, as the system continues to run in closed loop 
in a small neighbourhood of its set point (Astrom and Hagglund, 1988; Voda, 
1996). It is possible to use a relay without or with hysteresis, the latter being 
less sensitive to the noise and therefore likely to make a relay without hysteresis 
swing. 



Relay 




Figure 4.4: Use of a relay to provoke system oscillations for tuning (Astrom 
and Hagglund, 1988). Note that in the presence of the relay, the proportional 
controller Gc has a gain of 1 so that the controller has no influence 



Consider the case of a relay with hysteresis (also called dead zone relay). In 
the presence of the relay, the process output enters in a limit cycle that is to 
be characterized with respect to its amplitude and frequency (Coughanowr and 
Koppel, 1985). The error signal e{t) = yrm{t)—ym{t) (Fig. 4.5) is approximated 
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by a sinusoidal function equal to 

e(t) ^ A sin(cjt) (4.8) 

A perfectly sinusoidal error signal would correspond exactly to an ellipsoidal 




Figure 4.5: Error signal. 



limit cycle. 

Consider a relay with hysteresis whose state changes either by passing from 
a value yrei,max to a value yrei.min when the error signal which excites it crosses 
a small positive value e by becoming lower than e or by passing from a value 
yrei.min fo a value yrei,max wheu the error signal crosses the value (— e) by 
becoming larger than (— e). The action of this relay can be symbolized by Fig. 
4.6. 

With respect to the error signal e(t), the rectangular wave of the relay 
symbolized by Fig. 4.6 is a signal in advance by l/cjarcsin(e/A) corresponding 
to the hysteresis. This rectangular wave yrei{t) of period T = can be 

described by means of the periodic function /(t) with no advance 



yvei(t) = f{t + l/cjarcsin(e/A)) , with: f{t) = ^ 

This function f{t) can be expanded as a Fourier series as 



yrel,'. 



yrel,min 

I 



T 

if: 0 < t < — 
fit) V t 

(4.9) 



f{t) — iUmax ymin) ^ ^ 



fe =0 



(2k + 1)7T 



sin((2/c + 1) cj t) 



(4.10) 



and the rectangular wave of the relay becomes 

^ 2 



yrel(t) — (yraax ymin) ^ ^ 



k=0 



(2k + 1)7T 



sm((2k + 1) cj (t + l/cjarcsin(e/A))) 

(4.11) 
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Figure 4.6: Action of relay with hysteresis excited by the error signal e{t) 



which can be approximated by its first harmonics, as the influence of the other 
harmonics is weighted by lower coefficients and is largely filtered by the other 
elements of the feedback loop 

2 

yrei{t) W {ymax ~ ymin) ~ sin(o; {t + l/w arcsin(e/yl))) (4.12) 

7T 

Set the relay amplitude au = {ymax ~ ymin) I Thus, the relay acts essentially 
through its first harmonics of amplitude da^^/TT on the closed-loop process whose 
measured output oscillates with an amplitude ay and a period Tc according to 
a limit cycle (Fig. 4.5). The relay which has the error e{t) as an input and 
yrei (t) as an output is described by a nonlinear function, which is indeed a pure 
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gain Grei{ciu) depending on the amplitude Gu of the rectangular wave of the 
relay 

Orel = — (4.13) 

TTGy 

The condition of oscillation of the limit cycle is 

G reliflu) Gpr^Q^i^juJc) = 1 (4.14) 

with: Gproc{s) = Ga{s)G{s)Gm{s)^ the critical pulsation ujc being equal to 




In chapter 5, ujc is called the phase crossover frequency and Tc is denoted 
by the ultimate period Tu. 

From the limit cycle condition, the process transfer function at the critical 
pulsation is deduced 

irn 

GprocUuJc) = -T^ (4.16) 



The ultimate gain Kcu of the proportional controller is thus the reciprocal of 
the modulus of Gbo(j^c) 

Kcu = — (4.17) 



The condition of oscillation is that the curves Gproc(j^) and —IjGrei intersect 
in the complex plane, which corresponds to a Nyquist diagram. 

From the knowledge of the ultimate period Tc = Tu and the ultimate gain 
Kcu^ the initial tuning of the controller can be performed following Ziegler- 
Nichols tuning recommendations (Table 5.2). 

This relay technique has been improved and extended for processes with 
delay (Scali et ah, 1999) and multivariable processes (Semino and Scali, 1998). 

Example 4.1: Use of the Relay Oscillation Method 

A third-order system presents an oscillation-critical frequency. Consider the 
same system as in Sect. 3.3.3 which contains an actuator represented by a 
hrst-order Ga = 1/(5 + 1), a process represented by a second-order Gp = 
1 /( 5 ^ + IO 5 + 24) and a transmitter that has a pure gain equal to 8. In the 
absence of a relay, the controller is set in proportional mode with a gain equal 
to 30. This gain does not give a zero steady- state deviation, but with a slightly 
higher gain, the process becomes unstable. Then, a proportional controller is 
taken with unity gain (if the gain was left equal to 30, the result would be 
very close) and the system is subjected to a step unit since t = 0 and the relay 
placed before the actuator is operated at the same time. The relay possesses 
the following characteristics: its state changes from closed to open for an input 
value taken as 0.001 (a small value corresponding to a zero error signal) and 
from open to closed for the same value (taking 0.001 or —0.001 as in the previous 
theory would have very little influence), while the value corresponding to the 
actuator input around the steady state is about 22. When the relay state is 
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Figure 4.7: Responses obtained in the case of the use of a relay to make a 
third-order system oscillate. Top curve: relay input. Middle top curve: relay 
output. Middle bottom curve: process output y with set point. Bottom curve: 
measured output ym 



closed, its output is open to 38; when its state is open, its output is equal to 
6, so that the amplitude of the rectangular relay wave is equal to 16. It is thus 
observed that it is possible to make the system oscillate around a given state, 
which is indeed its set point (Fig. 4.7) with a frequency ujc = 27t/T^, in the 
present case: uJc = 27t/ 1.096 ~ 5.73 rad/time unit. The frequency thus found 
is near the critical frequency found in Sect. 3.3.3. Because of the nonlinearities 
introduced by the relay and the slightly different proportional gain, it is not 
exactly the same. 
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The amplitude of the relay output is = 16 while the amplitude of the 
measured output is ay ^ 0.473. Thus, the ultimate gaiu of the proportioual 
coutroller giveu by Eq. (4.17) is K^u ~ 43.07, close to the exact value fouud iu 
Sect. 3.3.3. Kuowiug the ultimate period aud the ultimate gaiu Kcu’, the 
iuitial tuuiug of the coutroller cau be easily performed followiug Ziegler-Nichols 
tuuiug recommeudatious (Table 5.2). 



The state arouud which the oscillatious occur correspouds to the steady state 
provided that a sufficieutly high proportioual gaiu is choseu to avoid the steady- 
state offset. 

Theoretically, ouly systems of order larger thau 2 or haviug a time delay 
cau possess a phase augle which cau become lower thau —180° above some 
frequeucy. Iu practice, uearly all systems preseut a time delay, which eveu cau 
be low, aud thus cau oscillate iu this mauuer. 

4.5.4 Process Reaction Curve Method 




Figure 4.8: Opeu coutrol loop for Coheu aud Coou method 

To get the process reactiou curve, Coheu aud Coou (1953) recommeud opeuiug 
the coutrol loop (Fig. 4.8) betweeu the coutroller aud the actuator aud im- 
posiug a step ou the actuator iuput Ua- Betweeu the Laplace trausform of the 
measured output Ym aud that of the actuator iuput Ua^ the trausfer fuuctiou 
correspoudiug to the process reactiou curve is 

Gprc = = Ga Gp Gm ( 4 - 18 ) 

Pa 

Thus, the dyuamics of the measured output depeuds uot ouly ou the process, 
but also ou the actuator aud ou the measuremeut device. These three elemeuts, 
process, actuator aud seusor, coustitute the physical euviroumeut which cauuot 
be dissociated to get a measuremeut value with respect to a giveu actuator 
positiou. 
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The measured responses ym frequently have a sigmoidal allure observed in 
Fig. 4.9. This curve is approximated by a curve corresponding to a first-order 
system with delay transfer function (Fig. 4.9) 



^ ^ ^ KeM-tds) 

Ua T5+1 ^ ‘ ^ 

Thus, three parameters must be estimated from the experimental curve: 

• K = Bj A with B as the steady- state output, and A as the amplitude of the 
input step 

• r = BjS with S as the slope of the sigmoidal response at the inflection point 
(note that this determination of S is imprecise and results in an approximate 
value of r). 

• td time gone before the system response 

One thus obtains the curve approaching the sigmoid. This method is also 
called Broida’s method. If r is difficult to estimate practically, it can be well 
approximated by the residence time Tres 



Bes 




G'jO) 

G(0) 



(4.20) 



where g{t) is the impulse response of a stable process having the transfer func- 
tion G{s). 

It would be possible to choose an approximate model of a delayed second- 
order transfer function, which would provide a better approximation 



_Ym ^ K exp{-td s) 
Ua ^ (ns + 1)(t2S + 1) 



with: Ti > T2 



(4.21) 



The gain K and the delay td are obtained in the same manner as previously 
discussed. The time constants ri and T 2 can be identified by a least-squares 
method or approached by the moments method or Harriott’s method. 

Then the parameter values of the approximation transfer function must be used 
to deduce the tunings of different controllers. Assuming a delayed first-order 
transfer function, to get the below-mentioned values, Cohen and Coon (1953) 
have used the following criteria: 

• One quarter decay ratio. 

• Minimum steady- state offset. 

• Minimum integral of square error (ISE). 

They obtained the following recommended tunings: 
a) Proportional controller: 



Kc 



I r 

K Td 




(4.22) 



b) PI controller: 



Kc 



I r 

K Td 



0.9 + 



td 

I2r 



(4.23) 
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Figure 4.9: Parameter determination according to the process reaction curve 
method recommended by Cohen and Coon (sigmoidal response: curve a, ap- 
proximation by a hrst-order transfer function with time delay: curve b) 



c) PID controller: 



30 + 3td/r 
~ 9 + 20 td/r 


(4.24) 


■ - IL f^ + ^'] 

K td 0 4rJ 


(4.25) 


32 + 6td/r 
~ “^U + Std/T 


(4.26) 


~^\l+2td/T 


(4.27) 



These values must not be considered as hnal values, but as initial values of con- 
troller tuning, in particular when the response given by the hrst-order system 
with time delay goes far from the open-loop system response curve. According 
to De Larminat (1993), a PID controller gives excellent results when the ratio 
T jtd is large (larger than 5 or 10). If td > 0.5r, i.e. the delay is relatively large 
compared to the process-dominant time constant, it must be considered that a 
simple PID controller is not appropriate. 

The following generalities valid for other tunings and relative to P, PI and 
PID controllers can be observed: 

• The gain of the PI controller is smaller than that of the P controller (because 
of the tendency of integral action to destabilize the process). 

• On the other hand, the gain of the PID controller is larger than those of the 
P and PI controllers, because the derivative action stabilizes the process. 
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4.5.5 Tuning rule of Tavakoli and Fleming for PI con- 
trollers 



The tuning of PI controllers for first order systems with delay proposed by 
(Tavakoli and Fleming, 2003) guarantee a minimum gain margin of 6dB and a 
minimum phase margin of 60° even when the delay td is large with respect to 
the time constant r of the system. The rules are 



KKc= 0.4849 —+ 0.3047 

T 

— = 0.4262 — +0.9581 

T td 



(4.28) 



where K is the gain of the open-loop system. This method gives results close 
to that proposed by (Hang et ah, 1991). 



4.5.6 Robust tuning rule for PID controllers 

(Astrom and Hagglund, 2004) have conducted many studies on PID control. 
Recently, they proposed approximate M-Constrained integral gain optimiza- 
tion (AMIGO) to increase the robustness expressed through the parameter M. 
Their PID controller presents a formula slightly different from the classical PID 
as 



r{t) = K, 



1 r 1 

{byr{t) - yf{t)) H / (y^(x) - yf{x))dx H 

Ti Jo Td 



1 ( J,yr{t) 



dt 



where r{t) is the output signal from the controller, the set point, 
filtered output given by the filter transfer function 






1 



F/(^) 

y(s) {Tfs + 1)2 



or: Gf{s) = 



1 



TfS + 1 



dt ) 
(4.29) 
yf the 



(4.30) 



The second-order filter is chosen to increase the filtering action. Parameters 
h and c in equation (4.29) are called set point weightings which influence the 
response to set point changes. The controller thus designed with set point 
weighting has two degrees of freedom: set point and disturbance responses 
are both taken into account. For a process described by a first-order transfer 
function with delay such as (4.19), the relative dead time is defined as 



td^r 

and the AMIGO tuning rules for a PID controller are 

OAtd + 0.8r 

Tl = td , , n _ 



(4.31) 



(4.32) 



(4.33) 
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0.3 td + r 



(4.34) 



When Tr > 0.3, these rules give good results. For lower values, the gain Kc 
must be increased, r/ decreased and td increased. However, the tuning is 
robust for all processes. A conservative choice of parameter b is 



0 for r < 0.5 

1 for r > 0.5 



(4.35) 



while in general c = 0 except for smooth set point changes. 

For an integrating process represented by the transfer function 






K exp(-tds) 



the AMIGO tuning rules for a PID controller become 
0.45 



K, = 



Tl =8 td 



Td = 0.5 td 



(4.36) 



(4.37) 



4.6 PID Improvement 



As opposed to the ideal PID, a real PID, as already discussed in Sect. 2.3.4, 
with transfer function Gc{s) 



Gc{s) = K, 



TjS + 1 

TjS 



TpS + 1 
(3tds + 1 



or slightly modihed transfer function Gc{s) 



(4.38) 



^c(^) = K, 



1 + — + 
TjS 



TpS 



IR. 

N 



5 + 1 



(4.39) 



should be considered as the minimum PID controller implementable on a pro- 
cess. 

With respect to these versions, variants of PID controllers have been pro- 
posed in order to solve some problems which can occur in the use of classical 

PID. 



4.6.1 PID Controller with Derivative Action on the 
Measured Output 

It is often preferable (Astrom and Hagglund, 1988; Wolovich, 1994) to operate 
the PID controller by making the derivative action act no more on the error 
coming from the comparator but on the measured output (Fig. 4.10), under 
the theoretical form 

Ua{t) = Kc ^e(t) + ~ J e(x) dx - Td 



(4.40) 
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or practically 



Ua{s) = K, 



1 + — ) E{s) - 

TiS 



KcTp S 
m o I 1 






(4.41) 




Figure 4.10: Block scheme of the PID controller with derivative action on the 
measured output 

This manner of taking into account the derivative action enables us to avoid 
steep changes of the controller output due to the error signal variation during 
set point changes. This controller will be called PID with derivative action on 
the measurement. The theoretical derivative action is filtered by a first-order 
system so that the actual derivative action acts especially on the low-frequency 
signals; the high-frequency measurement is, at the most, amplified by coefficient 
N. 

4.6.2 Use of a Reference Trajectory 

A violent set point change, for example step-like, induces fast and important 
variations of control variable ix, which can be mechanically harmful to the 
actuator or which can lead the control to saturation. In order to minimize the 
brutal effect of set point change, it is possible to filter the set point by a unit 
gain first-order or overdamped second-order filter Gref (Fig. 4.11) so that the 
output y will be compared to a smoothed set point yref^ called a reference 
trajectory. In Fig. 4.12, it is easy to note that: 

— In the case of the use of this second-order reference trajectory, the output 
reacts more slowly, but does not present any overshoot anymore. 

— The control variable varies much less brutally and in a narrower range. 

4.6.3 Discretized PID Controller 

In a system controlled by a computer, measurements are made at discrete 
times tk such that two successive instants are separated by a sampling period 
Tg. Without entering into the details of the choice of the sampling period (Ta- 
ble 9.3), it is useful here to mention the incremental form of a PID controller 
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Figure 4.11: Control with reference trajectory 




Time (s) 



Figure 4.12: Upper figure: response to a set point unit step without reference 
trajectory (curve a), middle figure: Reference trajectory and response to a set 
point unit step with reference trajectory (curve b), lower figure: corresponding 
controls. Process {Kp = 2,r = 5, (“ = 0.5) ; real PID controller (Kc = 10, r/ = 
5, Ti:) = 4, /3 = 0.1); second-order reference trajectory {K = 1, r = 0.5 , (“ = 1) 



(Astrom and Hagglund, 1988). From a given continuous form, several forms 
can be obtained according to the type of discretization (backward difference, 
forward difference, etc.). The Tustin algorithm, which gives the nearest form to 
the continuous controller is used in the form described below (Astrom and Hag- 
glund, 1988). The control is expressed by its variation between two successive 
instants 



Au{tk) = u{tk) - u{tk-i) = AP{tk) + AI{tk) + AD{tk) (4.42) 

where D represent the following proportional, integral and derivative 

contributions 



AP{tk) — P(tk) — iUrditk) Umitk) Urditk — l) T llmitk — l)) 

(4.43) 

The variable tjrd (reference) is equal to the set point tjr if no reference trajectory 
(set point filtering) is used; otherwise, it is equal to yref^ filtered set point or 
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reference trajectory. Astrom and Hagglund (1988) propose to take yrd as 

Vrd = byr ; 0<6<1 (4.44) 

where b is simply a constant. The choice 5 = 0 gives a sluggish response without 
overshoot, 5=1 (no filtering) gives a fast response with a possible overshoot. 
The integral term is equal to 

AI{tk) = I{tk) - - ym{tk-l)) (4.45) 

Tl 

Note that it would have been possible to replace yr by yrd- 
The derivative term is equal to 



AD{tk) = D{tk) - D{tk-i) 



{ym(tk) ^ ym(tk — l) T ?/m(^/c — 2 )) (4.46) 



with the coefficients a and 5 equal to 



2td — Ts N ^ 2 Kc N td 

2t]j Tg N 2 t]j Tg N 



(4.47) 



such taht 

D{tk) =aD{tk-i) + 5 {y{tk) - y{tk-i)) (4.48) 

It is necessary to choose td > NTg/2 (otherwise a < 0). The prescribed form 
amounts to filtering the ideal derivative action by a first-order system with time 
constant td/N. If N is chosen to be large, the high-frequency measurement 
noise is amplified. 



4.6.4 Anti-Windup Controller 

For the purpose of anti-windup, a controller based on a real PID controller with 
derivative action on the measured output is used. 

The actuator presents physical limitations (a valve position is situated be- 
tween full opening and full closure), so that when the control variable imposes 
the actuator position at its limits, the feedback loop becomes inefficient as the 
control is saturated and cannot vary anymore. In this case, the error becomes, 
in general, important and the integral term even more so. The integrator is 
subjected to windup; this is also called integral saturation. The solution is to 
stop the action of this integrator as long as saturation lasts. To avoid the unde- 
sirable windup phenomenon, it is possible to add an additional feedback loop by 
using the difference between the controller output and the actuator model 
output (Fig. 4.13) (Astrom and Hagglund, 1988; Hanus, 1990, 1996; Hanus 
and Peng, 1992; Hanus et ah, 1987). In the absence of actuator saturation, this 
difference is zero. When the actuator saturates, the proposed anti-windup 
system acts to try to bring back this difference towards zero. The higher 
the loop gain (the time constant is low), the faster this system reacts. 

The anti-windup system has been tested in the following configuration: 
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Figure 4.13: PID controller with anti- windup device 



— The process is an underdamped second-order system presenting the follow- 
ing characteristics: Kp = 2, = 5, (“ = 0.5. 

— The actuator saturates when the input is out of interval [—1, 1]. 

— The used controller is a real PID with derivative action on the measured 

output: iCc = 30 , T/ = 2 , Ti:) = 2 , = 5. 

— The system is subjected at time t = 1 to a set point unit step. 

In the absence of an anti- windup system (it suffices to choose gain l/r^ zero), 
the actuator saturates during long periods, both on the positive and nega- 
tive sides, which provokes a large overshoot of the output with respect to the 
set point (Fig. 4.14). When the anti- windup system is installed with gain 
1/rt = 10, the actuator still saturates at the beginning, but during a much 
shorter period, and the output joins the set point back smoothly with very 
little overshoot (Fig. 4.14). 

It can be noticed that, with an integral gain very slightly higher (with 
T/ = 1,5), when the anti- windup system does not work, the system is near 
instability. If the anti-windup system is operated, this problem disappears. 
Similarly, by imposing still more severe constraints on the actuator, it becomes 
very difficult, or even impossible, to function without anti-windup system, while 
with the latter, the process mastering is realized without difficulty. 

4.6.5 PID Control by On— Off Action 

In many processes, it may happen that the actuator offers only two possibilities: 
on or off. Such a case occurred in a lab crystallizer, where heating is achieved 
by electrical resistance and the only possibility is to switch the electric current 
on or off. If a simple controller is used, it will result in important oscillations 
around the set point, even more so as the process is highly nonlinear because 
heating is performed by radiative heat transfer while cooling is, in general, 
performed by a coil or a jacket or even natural convection for an oven for 
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Figure 4.14: Top figure: input u and output y in response to a set point unit 
step without anti- windup system (l/r^ = 0), lower figure: input u and output 
y in response to a set point unit step with anti-windup system (l/r^ = 10). 
Process (iCp = 2,r = 5,(“ = 0.5); real PID controller (iCc = 30, r/ = 2^td = 
2, AT = 5) 

example. 

The following strategy allows us to deal with such a case. First, a base 
period is chosen and this period is divided into n sub-intervals tc = tb/n, 
which will be the control period: the actuator will be allowed to change at each 
new sub- interval. 

An integer Uh takes the value 1 if heating is desired, 0 if no heating. 

An internal counter inter is added, which counts from 0 to n. When the 
counter reaches n, it is reinitialized to 0. The counter advances by one unit at 
every tc- A digital clock gives the time td- The counter is equal to 

inter = mt(td/tc) — int(td/{tcn)) n (4.49) 

where “int” indicates the integer part of a variable. 

A positive maximum error emax is given by the user. It corresponds to a 
very large error due to underheating at the beginning of operation for which 
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we can accept that the heating is on during all A positive minimum error 
Cmin is also giveu. It corresponds to the absolute error due to such overheating 
that the heating must be switched off. 

The error e = T^ — T between the reference temperature and the crystallizer 
temperature is normalized by the maximum error so that its absolute value is 
between 0 and 1 

en = ^ (4.50) 

^max 



A fraction of heating is given by 



ih = int(e„ n) 



(4.51) 



which is the number of sub-intervals tc with respect to during which it is 
necessary to heat. 

Then, a rule providing the value of h must be chosen 



if e > CjYiax 

else if e < -Cmin 
else if inter < ih 
else if inter > ih 



then Uh = I 
then Uh = 0 
then Uh = I 
then Uh = 0 



(4.52) 



That set of rules can be implemented between a normal PID controller and 
the actuator. In our case, a discrete PI controller was used and the error e{t) 
used in the previous algorithm was replaced by the output of the controller. 
The gain of the proportional controller should be near I. 

Example 4.2: On-Off Temperature Control of a Batch Crystallizer 

Before implementation in the real process, a simulation model of the crystallizer 







# 

T [> 


0 



Figure 4.15: Batch crystallizer 



was designed. The state-space model of the batch crystallizer (Fig. 4.15) used 
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for simulation is 

dT 

dt 

dT^ 

dt 

dt 



u Aint (T^J —T) U A^xt {T — Tamb) 



Vi pi Cp^i 
Peff 

Vm Pm ^p,m 
UhP - Peff 



Vi Pi Cp,i 
u Ajnt {Tw — T) 
Vm Pm P'p,m 



with the parameter values given in Table 4.4. 



(4.53) 



Table 4.4: Main parameters and initial variables of the crystallizer 



Reactor volume 


V = 4.5 X 10-^ m^ 


Density of reactor contents 


p = 1000 kg.m“^ 


Heat capacity of reactor contents 


Cp = 4185 J.kg-I.K-I 


Internal global heat transfer coefficient 


UAint = 14 W.m-2.K-1 


External global heat transfer coefficient 


UAea^t = 0.7 W.m-2.K-1 


Volume of wall metal 


Vm = 0.9 X 10“^ m^ 


Density of wall metal 


pm = 8055 kg.m“^ 


Heat capacity of wall metal 


G^,^= 490 J.kg-i.K-i 


Heat power of the electrical resistance 


P = 1750 W 


Time constant of the electrical resistance 


= 5 s 


Initial temperature in the crystallizer 


T = 290.15 K 


Initial temperature in the crystallizer wall 


= 290.15 K 


Initial effective heat power 


Peff = 0 W 



The parameters were chosen as n = 30, tc = 1, and for the PI controller: 
Kc = 0.5, Tj = 10000. Figure 4.16 shows that the temperature regulation is 
well ensured. 



4.6.6 pH Control 

The pH control in neutralization processes poses difficult problems because of 
the extreme nonlinearity of the pH curve during neutralization (Fig. 4.18). 
Shinskey (1988) devotes an important part of his book to pH control and pro- 
poses several solutions to practically remedy those problems. Lee et al. (2001) 
conduct an important review of the various propositions for pH control. Many 
different approaches for pH control have been proposed in the literature. Even 
in an early paper, McAvoy et al. (1972) used the balances and equilibrium 
equations. When several weak and strong acids or bases are part of a mul- 
ticomponent medium, knowledge of the chemical equilibria can be intricate. 
The concept of invariant reaction was introduced by Gustafsson and Waller 
(1983) and then extended (Gustafsson et al., 1995). Other approaches based 
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Figure 4.16: Heating of the crystallizer. Top figure: set point and tempe- 
rature in the crystallizer. Lower figure: effective power during heating of the 
crystallizer 



on identification have been proposed, and adaptive control is frequently cited 
(Gustafsson and Waller, 1992; Lee et ah, 1993, 1994; Sung et ah, 1998). Non- 
linear control is proposed by Wright and Kravaris (1991). Often, identification 
reactors (Gupta and Goughanowr, 1978) or in-line mixers are proposed (Sung 
et ah, 1995), eventually coupled with feedforward control (Lee and Ghoi, 2000). 
Ghoi et al. (1995) propose a process simulator for pH control studies. 

As a simple example, consider a fed-batch chemical reactor (Fig. 4.17). At 
the initial time, this reactor contains Vb = 10 cm^ of O.IN hydrochloric acid 
HGl. The neutralization is performed by the use of O.IN caustic soda NaOH 
with an inlet fiow rate Fin (0.05 cm^/s). Though these are respectively a strong 
acid and base, their concentration is rather low. The chemical reaction is 

H+ Cr + Na+ OH“ ^ Na+ GP + H2 O 
Assuming an inlet fiow rate Fin (cm^/s), the different balances read 

Fin 

^(|OH-],„-Na+l) (4.54) 

-f^icrl 

to which we must add the equilibrium constant K^q = 10“^^ (at 25 °G) for 



dV 

dt 

d[Na+] 

dt 

d[cr] 

dt 
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water dissociation 

[H+] [OH-] = Keg (4.55) 

and the electro-neutrality equation 



[Na+] + [H+] = [Cr] + [OH“] (4.56) 

The concentration results from this equation, from which the pH is deduced 

pH = -logio([H+]) (4.57) 



The solution of the previous equations gives Fig. 4.18 which is typical of the 
neutralization of a strong acid by a strong base where the desired pH is the 
neutral pH equal to 7. The control question is the following: for any type of 
reactor, fed-batch or continuous, perfectly stirred or plug flow, how is it possible 
to maintain the pH in the neighbourhood of a given value? 



NaOH 



L 






B 


0 

[Cl 



Figure 4.17: Fed-batch neutralization reactor in initial conditions 




- desired pH 
ApH 



Figure 4.18: Typical pH variation during neutralization 




Process Control 



Controller output 
(deviation var.) 



157 




Figure 4.19: Zones of different controller gain according to pH 



Around the neutralization point, for very small amounts of added reactant, 
the pH changes very rapidly, while far from the neutralization point, large 
amounts of reactant provoke a small change. Under these conditions, a clas- 
sical controller with a hxed gain is unable to operate correctly. Moreover, the 
shape of the neutralization curve depends on the chemical components and it 
is recommended to have an idea by titration of the chemical species involved. 
Generally speaking, the controller can be composed into three parts (Fig. 4.19), 
one in a zone well below the desired pH, another one well above the desired 
pH where a high gain can be used, and a third zone around the desired pH 
where a low gain is necessary. If a linear controller with a single gain is cho- 
sen, there exist two possibilities: either the controller gain is very low and 
the corresponding flow of neutralizing agent is also low, resulting in very long 
time responses far from the desired pH, or the controller gain is high and the 
flow of neutralizing agent is large, inducing rapid variations around the pH set 
point, which results in limit cycles and the pH oscillating from low to high 
values. The proposed nonlinear controller with three zones does not present 
that drawback. The ratio between the high and the low gains can be as large 
as 1000; it depends on the type of chemical acids and bases that are present. 
Thus, the controller can only be tuned after titration of the different media and 
its tuning must be modihed if the nature of the influent changes. For those 
reasons, adaptive and feedforward controllers can be proposed for pH control. 
Shinskey (1988) also proposes to use two equal-percentage valves, one for a 
large flow rate, the second for a flow rate slightly over the minimum capability 
of the hrst one. Thus, by operating only one of the two valves at any time, the 
very different conditions of flow can be achieved with negligible interruption 
during the passage from one valve to the other. 

Example 4.3: Control of a Neutralization Reactor 

Consider a continuous neutralization reactor (Fig. 4.20) which has two efilu- 
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ents, one acid represents the stream to be neutralized and is a disturbance, and 
one base, which is the neutralization solution and the manipulated input. As 
a simple example, the acid is assumed to be a strong acid such as HCl and the 
base is a strong base such as NaOH. 

NaOH HCl 




Figure 4.20: Continuous neutralization reactor 

The acid inlet flow rate is denoted by Fa^in (m^/s) and the base inlet flow rate 
is (m^/s). The level control is assumed to be perfect. With respect to the 
previous fed-batch reactor, the different balances are slightly modified 

Fa, in T = Fqui 

^^ = AeMn[OH-],„-F[Na+]) (4.58) 

^^ = ^{Fa,in [Cr]-F[Cl-]) 

The equilibrium equation for water dissociation and the electro- neutrality equa- 
tion are not modified. 

A variable-gain PI controller has been used for controlling this reactor ac- 
cording to the conditions given in Table 4.5. 



Table 4.5: Main parameters and initial variables of the neutralization reactor 



Acid inlet flow rate 


Fa, in = 20 1/s for t in [0,400] s 
then 40 1/s for t in [400,600] s 


Acid inlet concentration 


Ca,in = 0-2 mol/1 for t in [0,200]s 
then 0.4 mol/ for t in [200,600]s 


Base inlet concentration 


Cf,,m = 1 mol/1 


Reactor volume 


H = lOM 


Initial pH 


pH(t = 0) = I 


High proportional gain 


Kc = 500 


Low proportional gain 


Ka = 50 


Integral time constant 


T/ = 100 s 


Maximum base flow rate 


o 

o 

T— 1 

II 
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Figure 4.21: pH control of a neutralization reactor. Top: pH controlled. 
Bottom: inlet flow rate of base 



Initially, the continuous neutralization reactor is far from the steady state. 
From Fig. 4.21, it appears that at the beginning there is a saturation of the ma- 
nipulated base flow rate. Then a stabilization with oscillations occurs around 
the desired neutral pH. The acid concentration disturbance at 200 s and the 
acid flow rate disturbance at 400 s are both correctly rejected. 



4.7 Direct Synthesis Method 

Considering the closed-loop block scheme (Fig. 2.19), one obtains the relation 



^ Gds)Gais)Gds)Km 

l+Gp{s)Ga{s)Gds)G^{s) 



1 + Gp{s) Ga{s) Gc{s) Gm{s) 



D{s) 



(4.59) 

hence the closed-loop transfer function corresponding to the set point variations 



Y 



GpGaGcKm 



Set 



Yr 1 + Gp Ga Gc Gjn 



G — GaGp 



(4.60) 

(4.61) 



r _ GGcKm 

Yr 1 + GGcGjn 



One deduces 



(4.62) 
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hence the “theoretical” controller transfer function 



Gn — 



Y_ 

Yr- 



C TY — n X- 

^ Ytjyiy 



(4.63) 



In fact, the process transfer function Gp, thus globally G, is not perfectly 
known and the ratio Y/Y^ is unknown because the controller has not yet been 
designed. This imposes replacement of the transfer function G by the supposed 
or estimated process model G and the ratio Y /Y^ by a desired ratio (Y/Yr)^. 

In this case, one gets 



Gc 



G Kjj 




(4.64) 



It is remarkable that the process model appears in Gc through its inverse; this 
feature is one of the characteristics of model-based control techniques. This will 
induce a given number of problems concerning poles and zeros: the controller 
poles will be the process zeros and vice versa. It will be necessary to check 
whether this controller is stable and physically realizable. 

The direct synthesis technique is essentially of theoretical interest and should 
be carefully used for processes having unstable poles or zeros. Internal model 
control, which follows in Sect. 4.8, presents some common features and should 
be preferred. 



4.8 Internal Model Control 

The controller design method called internal model control has been in partic- 
ular developed and popularized by Garcia and Morari (1982). At this point, 
we emphasize that it differs from the principle of internal model by Francis and 
Wonham (1976) explained in Sect. 5.9 and it is useful to compare both ideas. 
Internal model control is based on the knowledge of a supposed model of the 
process. The model uncertainty is directly taken into account. It is possible to 
compensate the performance of the control system by its robustness to process 
modifications or modelling errors. This control is recommended in particular 
when the ratio r/G from Cohen and Coon (1953) is small (lower than 5). 

In the block diagram (Fig. 4.23), just as for direct synthesis, the process 
transfer function G represents in fact the process itself and its instrumentation 
(actuator and measurement). The process model G obtained by identification 
and the controller output Ua allow us to calculate Y. In general, the real process 
G and its identified model G differ, moreover disturbances D infiuencing Y are 
unknown, thus Y is different from Y . 

In a classical feedback control scheme (Fig. 4.22), the output is related to 
the set point and to the disturbance by 

GGcYrXD 



I + 



(4.65) 
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In internal model control (Fig. 4.23), denoting the calculated controller by GJ, 
the following relations are obtained 

Y = GUa^D (4.66) 

Y = GUa (4.67) 

Ua = G^E = Gl{Yr -Y ^Y) = Gl {Yr -Y^GUa) (4.68) 

hence 

{l-GGl)Ua=Gl{Yr-Y) (4.69) 

Two characteristics are desired by the user: 

— A perfect set point tracking, thus y = tjr, when no account is made of 
disturbances. 

— A maximum rejection of disturbances, thus a minimum influence of d, when 
the mode is regulation. 

In these conditions, to get a perfect set point tracking which is equivalent to: 
Y (s) = Tr(^) when D = 0, it suffices that 

(1-GG:)=0-^G: = 1 (4.70) 



This condition is realized when the model is perfect, i.e. if G = G. 
From the expression 

Y = GUa + D 

using Eq. (4.69), the general expression of output Y results 



Y = 



GG! 



■Yr + 



1 - GG! 



i + g;(G-G) i + g;(G-G) 



■D 



(4.71) 



(4.72) 



This expression is equivalent to Eq. 
controller is chosen 

Gc = 



(4.63) of direct synthesis if the following 



g; 

l-GG% 



(4.73) 



The disturbance rejection is studied in regulation, so that Yr = 0, and 
disturbances are perfectly rejected when 



(1 - GG*) = 0 




(4.74) 



which constitutes a condition analogous to the previous relation (4.70) when 
the model is perfect. 

The theoretical controller is thus given by the transfer function 

g; = i (4.75) 

This is the classical relation, according to which the controller transfer function 
is equal to the inverse of the process model. Such a controller would be physi- 
cally unrealizable when the degree of the denominator of the process transfer 
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Figure 4.22: Block diagram for classical feedback control 




Figure 4.23: Block diagram for internal model control 



function is strictly higher than the degree of the numerator of this transfer 
function. Moreover, positive zeros or zeros with a positive real part as well as 
time delays present in the process transfer function set a difficulty: positive 
zeros or zeros with a positive real part for the process would be controller poles 
and render it unstable. Process time delays would result in pure advances for 
the controller, thus the latter would be physically unrealizable. 

For this reason, internal model control design is performed in two stages: 
Stage 1: the process model G is decomposed into a product of two factors. The 
first one, (T+ (the gain of which will be taken to be equal to 1) contains time 
delays and positive zeros or zeros with a positive real part 

G = G+G- (4.76) 

Stage 2: only (5_ is retained (to put aside the time delays and positive zeros or 
zeros with a positive real part) and the inverse of G- is filtered (to make the 
controller physically realizable). The real controller transfer function is then 
equal to 

Gc = T / (4.77) 

where / is a lowpass filter with gain equal to 1. 

This filter is typically of the form 

/ = — - — 

(t^s + 1)’’ 



(4.78) 
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where is the desired closed-loop time constant. The exponent r is chosen so 
that Gc is a real transfer function (the denominator degree must be larger or 
equal to that of the numerator) or possibly corresponds to a derivative action 
(in this case, the denominator degree is lower by one unity to that of the 
numerator) . 

With these precautions (decomposition into the two previous factors), the 
controller Gc is physically realizable and stable. It must be noted that as the 
internal model control method is based on zeros cancellation, it must not be 
used for unstable open-loop systems. 

In the ideal case where the model is perfect {G = G)^ the output is equal 
to 

Y = G^fYr^{l-G^f)D (4.79) 

and in a general manner to 



^ y I g- -Gf p 

G- +f(G-G) G_ +f(G-G) 



For real implementation, it is necessary to consider the individual models 
of the actuator Ga and the process Gp, the measurement Gm such that the 
previous transfer function G contains the actuator, the process and the mea- 
surement 

G = Ga Gp Gm (4-81) 

The real implementation will thus be performed according to the block diagram 
in Fig. 4.24. 




Figure 4.24: Block diagram for real implementation of internal model control 

Internal model control allows us to obtain the PID tunings (Rivera et ah, 
1986) for different process models such as in Table 4.6. In the original table, 
zero- (exp(— tf^s) ^ 1) oi first-order Pade approximations are used for time 
delays when necessary (the zero-order approximation requires e > td and the 
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Table 4.6: PID controller parameters based on internal model control from 
Rivera et al. (1986) 



Model 


II 


Controller 


KcK 


Tl 


td 


K 


I 


TS + 1 


T 






TS + 1 


es + 1 


Kes 


e 






K 


1 


(ns + 1 )(t 2 s + 1) 


n + t 2 


n + t 2 


TiT2 


(ns + 1 )(t 2 s + 1) 


es + 1 


Kes 


e 


n + t 2 


K 


1 


T^S^ + 2(ts + 1 


2Cr 


2Cr 


T 


r‘^s‘^ + 2(rs + 1 


es + 1 


Kes 


e 


2C 


-/3s +1 


~ds + 1 


TS + 1 


T 






TS + 1 

K 


es + 1 

1 


K{P + e)s 


yd + e 

1 


T 




s 


es + 1 


l^e 


e 


T 




J^exp(-tds) 




PI 


2t + td 


td 

^ + W 




TS+1 






2e 


2 




r^exp(-ids) 




PID 


2t + td 


td 

^ + w 


Ttd 


TS+1 




+ td 


2t + td 






2 



first-order approximation requires e > td/^). In the PI and PID tuning with 
the nonlinear delay term, e should always be greater than O.lr and greater 
than 1.7td or 0.8td respectively for PI and PID. e is, in general, the closed- 
loop time constant that can often be chosen as the dominant open-loop time 
constant. The original table presents many other models. Comparisons with 
the Ziegler-Nichols and Cohen-Coon tuning methods in the case of the first- 
order transfer function with delay are given by Rivera et al. (1986). In general, 
the performance measure and robustness are better for the IMC design. 

Example 4.4: Internal Model Control of a Thermostatic Bath 

A thermostatic bath (Fig. 4.25) is in charge of maintaining the temperature 
of a water stream circulating at low velocity by means of a coil in which water 
vapour circulates. The bath temperature T is the controlled variable while the 
vapour temperature Ty is the input and the water flow rate F is a modelled 
disturbance. The process can be approximately represented by the following 
energy balance 



FCp{T,n-T)^UA{Ty 



T) = MC,— 



(IT 

dt 



(4.82) 



with the nominal variables and parameters defined in Table 4.7. 

By linearizing the system around its steady state, the Laplace model can 
be written as 



f{s) 



0.0793 
1 + 2.585 



fy{s) 



0.331 
1 + 2.585 



F{s) 



(4.83) 



Suppose that, during process identification, errors occur which lead to the 
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Table 4.7: Nominal variables and parameters of the thermostatic bath 



Water flow rate 


F = 25 kg. min ^ 


Inlet water temperature 


Tin = 294 K 


Water mass to heat 


M = 70 kg 


Global heat transfer coefficient 


UA = 9000 J.min“^.K“^ 


Heat capacity of liquid water 


Gp = 4180 J.kg-hK-i 


Bath temperature 


T = 303 K 


Vapour temperature 


Ty = 407.5 K 




Figure 4.25: Thermostatic bath 



following model transfer function 



G{s) 



0.10 

1 + 35 



(4.84) 



while the disturbance is not measured, 
controller equal to 



Gc=-^f = 
G- 



This model 
1+35 

0.10 (1 + r 5) 



is used to calculate the 
(4.85) 



In the present case, the hlter time constant was chosen to be equal to r = 1 min. 
In simulation, instead of a Laplace transfer function for process (T, the exact 
state-space model was used as well as for the influence of the water flow rate 
disturbance. In this manner, the simulation result is closer to the real process 
behaviour; it takes into account modelling errors and unmodelled disturbances. 
After having reached its steady state, the process is hrst subjected to a set 
point step at temperature T of 5K at t = 50 min, and undergoes flow rate 
disturbance of 10 kg/min at t = 100 min. In Fig. 4.26, it is clear that the set 
point is reached fast and with very little overshoot, and that the disturbance 
is rejected without difficulty with a fast comeback to the set point. We notice 
that the variation of the vapour temperature which is the manipulated input 
is very steep at the set point step time and that probably in reality it would 
be limited by saturations or slower dynamics. 
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Figure 4.26: Internal model control of a thermostatic bath. Top: set point 
and bath temperature response to a set point step of 5K at t = 50 min and 
a flow rate disturbance of 10 kg/min at t = 100 min. Bottom: manipulated 
input, i.e. vapour temperature 

4.9 Pole-Placement 

The process transfer function is represented as 

Gis) = am) 

with degD > degN, and the polynomials N{s) and D{s) are coprime (have 
no common root); it can be useful to consider that G{s) is strictly proper: 
the equivalent state-space representation would be controllable and observable; 
nevertheless this is not compulsory. 

The block diagram (Fig. 4.27) can represent any control system such that its li- 
near controller is the most general possible controller. It will be qualified under 
the general term of pole-placement, which will be justified later, or RST con- 
troller. The controller output u{t) is influenced by both controller inputs yr{t) 
and [y{t) + where r]{t) is the measurement noise. The designation “two- 
parameter” or “two degrees of freedom” is also used to qualify this controller. 

The controller dynamics can be represented by the equation 



(4.87) 
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Figure 4.27: Process control system by pole-placement 



The controller is supposed to be proper; if it is strictly proper, it will be little 
influenced by high-frequency noise thus 

degR{s) < degS{s) and degT(5) < degS{s) (4.88) 

Moreover, d{t) being a disturbance, the output is equal to 

r(5) = N{s) Z(s) + d(s) (4.89) 



by introducing the partial state z(t). Dehning the output error e{t) = yr{t) — 
y{t), the important dynamics with respect to exogenous inputs (set point, dis- 
turbance, noise) are obtained 



i;(5) ■ 
d{s) 

V{s) 

(4.90) 

Closed-loop stability is thus conditioned by the roots of polynomial P{s) of 
Bezout equation or Diophantine equation^ 

P{s) = D{s) S(s) + N(s) R{s) (4.91) 



y(s) 


1 


NT 


DS 


-NR 




U (s) 


1 


DT 


-DR 


-DR 




. ^(4 . 


~ DS + NR 


DS T NR- NT 


-DS 


NR 





which are the closed-loop transfer function poles, thus impose the regulation 
and tracking behaviour. Only the polynomials N and D are known 

D{s) = ^ DiS^ ^ ^ ^ Dn-lS ^ Dn 

N{s) = + Nis^-^ + . . . + Nn-lS + Nn ^ ^ 

with D and N coprime. Because of hypotheses concerning proper transfer 
functions, we obtain 



degP{s) = degD S = n -\- degS (4.93) 

^From the name of Greek mathematician Diophante (325-410), also called the Bezout 
equation. 
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where n is the process order. 

Set 

S(^s) = ^ + . . . + Sfi^ — is -\- Sfi^ 

R{s) = Rqs^^ -\- Ris^^ ^ + . . . + Rji^—is -\- Rfir 



The model of the closed-loop dynamics is imposed by the user by speci- 
fying the closed-loop transfer function, thus the poles (hence the name pole- 
placement) 



r(g) 

r,(s) 



Gm{s) 



Nmjs) 

Dm{s) 



(4.95) 



where Nm{s) and Dm{s) are coprime and Dm{s) fixes the closed-loop poles. 
To get the closed-loop poles, Dm{s) must divide P{s). 

It is possible to distinguish: 

— The placement of poles and zeros where, at the same time, poles and zeros 
of the closed-loop transfer function are specified; it is also called reference 
model control. 

— The pole-placement where only the poles of the closed-loop transfer function 
are placed. 

These two cases will be considered simultaneously, and the consequences of this 
distinction will be studied. 

The process zeros, corresponding to N{s) = 0, are kept in closed loop, 
except if they are specifically cancelled by corresponding poles. Astrom and 
Wittenmark (1989) recommend that the choice of Gm{s) be related to that 
of G; although the zeros of G{s) can be modified by pole-zero cancellation, 
Astrom and Wittenmark (1989) advise keeping the zeros in the model Gm{s). 
Unstable and weakly damped zeros cannot be cancelled and N{s) is factorized 
as 

N{s) =N^{s)N-{s) (4.96) 



where N~^{s) is a monic^ polynomial containing the factors that can be eli- 
minated (corresponding to stable and well-damped zeros) and N~ (s) contains 
the remaining factors. Thus, the polynomial R(s) must be divisible by N~^(s). 
De Larminat (1993) proposes a strategy that realizes a robust pole-placement 
(Fig. 4.28) setting the zeros of R(s) from the zeros of N{s). The recommended 
procedure is developed according to the following stages (if necessary): 

a) Any unstable zero of N{s) is first replaced by its symmetrical zero with 
respect to the imaginary axis (zi z[). 

b) A weakly damped complex zero is replaced by a conveniently damped zero 

(Z2 ^ 4). 

c) A slow zero is replaced by a zero of convenient dynamics (removal from the 
imaginary axis to a distance reciprocal of the settling time) (zs Zg). 



^ A polynomial P{x) is called monic if the coefficient of highest degree monomial is equal 
to 1 

P{x) = x"^ A aix"^~^ + . . . + an-ix + an 
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Figure 4.28: Determination of the zeros of P{s) from the zeros of N{s) 



In general, the polynomial P{s) has a degree higher than Dm{s)N~^ (s), 
which divides it, and one sets 

P{s) = D^{s)N+{s)Do{s) (4.97) 

where Do{s) qualihes the observer dynamics for polynomial P{s): the factor 

Do{s) cancels itself by pole-zero simplihcation and this dynamics is not con- 
trollable by tjr. Do{s) will have to be stable and will have a faster dynamics 
than that of Dm{s). 

The polynomial P{s) is now specihed by Eq. (4.97). As, moreover, N{s) = 
{s)N~ (s), the Bezout equation 

D{s) S{s) + N+ (s) N~ (s) R{s) = (s) N+ (s) Do (s) (4.98) 

imposes that N~^{s) divides S{s) (because N~^{s) is coprime with D{s)), thus 

S{s) = N+{s)Si{s) (4.99) 

SO that the Bezout equation takes the dehnitive form 

D{s)Si{s)^N-{s)R{s)=Dm{s)Do{s) (4.100) 

From the expressions of the closed-loop transfer function, one deduces 

Pjs) Yjs) Pjs) Nmjs) Dmis)N+is)Do{s) Nmjs) Do{s) 

N{s)Yr{s) N{s) D^{s) N+{s)N-{s) D^{s) N~{s) ^ 

(4.101) 
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In order that T{s) is a polynomial and the Bezout equation has a 
is necessary that N~(s) divides A^^(s), thus 


solution, it 


Nm{s)=N-{s)Nm,l{^) 


(4.102) 


One obtains 




T{s) = Do{s)NmA^) 


(4.103) 


The conditions on the polynomial degrees (with degD = n) are 




degDo > 2n — degDm — degA^+ — 1 
degDm - degNm > degD - degA^ 


(4.104) 



thus allowing the physical realizability of the controller. 

The control law is deduced from polynomials (i?, S, T) 

S(s) U(s) = T(s) Yr(s) - R{s) Y{s) (4.105) 

Pole-placement, in the most general formulation, that is to say with par- 
tial zeros simplification (stable and well-damped), is thus constituted by Eqs. 
(4.95), (4.96), (4.97), (4.99), (4.100), (4.102), (4.103), (4.104) and (4.105). 

In the case where all zeros are simplified, it suffices to take N~^{s) = N{s)/a 
and N~{s) = a, a being scalar, with the same equations as previously. 

In the case where there is no zero simplification, it suffices to take N~ = 
N{s) and N^{s) = 1. 

The solution of a general Bezout equation of the form 





AS+BR= 


P 


( 4 . 106 ) 


A(s) 


= uqs” + + . 


. . + Cin-lS + CLn 




B{s) 


= fees'" + feis'”-^ + . 


• • + bm-lS + bm 




P{s) 


= PosP + PisP-^ + . 


. . + Pp-lS + Pp 


( 4 . 107 ) 


R{s) 


= Ros^ + Ris^-^ +. 


. . . T T Rj^ 




S{s) 


= Sos’^ + + . 


. . + Sk-is + Sk 





is performed by identification on successive powers of P{s) (p = /c + r + 2); a 
linear set of equations results 



ao 


0 


0 


0 ... 




0 








ai 


ao 


0 


feo 

fel feo 








■ ^0 ■ 




■ a ■ 


dn 

0 




ao 


bfi 




0 

^0 




Sk 

Rq 


= 








^n— 1 


0 


bfi 


bfi—i 




Rr 




. Pp . 


0 




0 CLfi 


0 ... 


0 


bfi 











/c+i columns 



r+i columns 



(4.108) 
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The matrix to be inverted during the solution is a Sylvester matrix, of dimension 
(2n X 2n), if k = r = n — 1. For a Sylvester matrix to be nonsingular, thus 
invertible, a necessary and sufficient condition is that polynomials A{s) and 
B{s) are coprime. Note that if S{s) was supposed to be monic, this would 
remove an unknown in the problem and would impose a constraint on P{s). 

In this design of a pole-placement controller, there always exists a pole-zero 
simplihcation. The pole-placement controller must be designed as a set that 
receives the reference yr{t) and the output y{t) and returns the input u{t), 
according to the diagram in Fig. 4.27. Chen (1993) proposes an analogous 
implementation from the equivalent state-space scheme. Rather than the theo- 
retical scheme of the pole-placement controller (Fig. 4.27), De Larminat (1993) 
recommends the scheme in Fig. 4.29. It is easy to show that this scheme is 
equivalent by setting 




Figure 4.29: Recommended scheme (De Larminat, 1993) for the implemen- 
tation of a pole-placement controller 



Wolovich (1994) shows that although the choice of the observer polynomial 
Do{s) has no apparent influence on the closed-loop response, as the closed- 
loop transfer function is identical whatever Do{s) is, indeed the choice of Do{s) 
influences robustness by modifying the sensitivity function, which takes into 
account process model uncertainties. 

Example 4.5: Pole-Placement for a Continuous Transfer Function 

Consider the following open-loop unstable process having the transfer function 



G{s) 



(s - 1)(5 + 2) 



(4.110) 
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giving: N{s) = 5 and D{s) = s‘^ s — 2. According to the ITAE criterion 
(Table 4.1), the desired closed-loop transfer function is chosen, which gives the 
step response of Fig. 4.30 



= 



5^ T 1.4cj0'^ T ^0 



(4.111) 



with cjQ = 0.5, corresponding to = 0.25 and = 5^ + 0.7s + 0.25. We 
deduce N~^{s) = 1, N~{s) = 5, A^rn,i = 0-05 and Si = S. 




Figure 4.30: Pole-placement controller: response to a set point unit step 



The polynomial Do{s) which will cancel itself by pole-zero simplification is 
chosen to be equal to Do{s) = 5 + 5, hence 



P{s) =Dm{s)N^{s)Do{s) 

= 5 ^+ 5.752 + 3.755 + 1.25 

with, furthermore, 

P{s) =N-{s)R{s)^D{s)Si{s) 
= 57 ^( 5 ) + (52 + 5 — 2 )/S( 5 ) 



(4.112) 



(4.113) 



The degrees of R and S are deduced: degR = 1 and degS = 1. The Sylvester 
matrix is then of dimension (4 x 4) and the system to be solved corresponding 
to the Bezout equation is written as 



" 1 0 0 0 " 




■ ^0 ■ 




■ 1 


110 0 








5.7 


-2 1 5 0 




R(j 




3.75 


0 -2 0 5 




. -^1 . 




1.25 



(4.114) 



The polynomials R and S are the solutions of the previous Bezout equation 



R{s) = 0.215 + 2.13 ; 6 ^( 5 ) = 6 ^ 1 ( 5 ) = 5 + 4.7 (4.115) 

From Eq. (4.103), the polynomial T is obtained 

T(5) = Do{s)Nm,i (s) = (5 + 5)0.05 = 0.055 + 0.25 (4.116) 
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The closed-loop behaviour will be that of the specihed transfer function Gm{s) 
(Fig. 4.30). Note that the overshoot is very low. 



4.9.1 Robustness of Pole-Placement Control 

Equation (4.90) gives the output error with respect to set point or reference, 
disturbance and noise 

E{s) = I (DS + NR- NT) Yr {s) - DS d{s) + NR fj{s) } (4.117) 

The set point can be expressed as 



V(s) 



N%s) 

£)'=(s) 



(4.118) 



The output error with respect to the set point can be decomposed into the 
natural error e”(t) and the forced error e^{t), related to the type of set point 



_ DS + NR- NT N%s) _ AT®(s) N%s) 
DS + NR D<^{s) ~ P{s) D<^{s) 

~ DS + NR D<^{s) 

= E^{s) + Ef{s) 



(4.119) 



A means to control the steady-state error is to fulhl the conditions so that 
the forced error be zero, thus (s) = 0 as the natural error tends towards 0 
when t ^ oo. 

The problem is now to hnd how we can obtain (s) = 0. It suffices that 



P{s) D^{s) ^ ~Y~ 



(4.120) 



This implies that D^{s) must divide N^{s) as D^{s) is coprime with N^{s). Set 



N%s) 



which would give the error 



E{s) 



■■ AT®'(s) D%s) 
N^{s) AT®'(s) 

W) 



(4.121) 



(4.122) 



In pole-placement, the internal model principle implies that a model of 
D^{s), the denominator in the expression (4.119) of the forced error, is present 
in the denominator D{s)S{s) of the open-loop transfer function, thus 



D{s)S{s) = N^'{s)D^{s) 



(4.123) 
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on the other hand, if 

R{s)-T{s) =R\s)D^{s) 
these two equations imply that the error 

_ DS ^NR- NT N%s) 
“ DS + NR D^{s) 
_ N<^' D'^ + R' D'^ N%s) 
DS + NR £)'=(«) 
N%N<^' + R') 

~ DS + NR 



(4.124) 



(4.125) 



does not depend on ^ thus the forced error is zero. Conditions (4.123) and 
(4.124) thus ensure robustness with respect to eventual process variations mo- 
difying polynomials N{s) and D{s) (Wolovich, 1994). 

Similarly, when the dynamics of undamped disturbances d(t) is known, e.g. 
d{s) = 1/5 for a step, it is advised to take it into account in the controller 
by including the part of the denominator of d{s) that poses a problem in the 
product D{s)S{s) (refer to Eq. (4.90)), in accordance with the internal model 
principle. For the disturbance, this would mean that 5 should be a divider of 
D{s)S{s)^ thus, in general, of S{s). 



4.9.2 Unitary Feedback Controller 

The general pole-placement controller of Fig. 4.27 can be reduced under the 
form of a unitary feedback controller, also called a one degree of freedom con- 
troller (Fig. 4.31) by assuming that T{s) = R{s). The controller transfer 
function becomes simply equal to 

GAS) = HI (4.126) 




Figure 4.31: Process control system with a one degree of freedom controller 

For this controller, the error e{t) = yr{t) —y{t) rules the controller and is made 
zero by the output unitary feedback. 

The fact that the controller has one or two degrees of freedom does not 
modify the number of integrators as, in both cases, the controller order is equal 
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to the order of S{s). Of course, it is simpler to design the one degree of freedom 
controller, as it is sufficient to determine the transfer function R{s)/S{s). The 
one degree of freedom controller makes only a pole-placement. 



4.10 Linear Quadratic Control 

In this section, a voluntarily reduced presentation of linear quadratic control 
is done by the use of continuous transfer functions (Wolovich, 1994). A more 
complete presentation, in multivariable state space, in continuous-time and in 
discrete-time, is outlined in Chap. 14 in relation to optimal control. 

Linear quadratic control allows us to choose a quadratic criterion which 
takes into account not only the error, but also the control signal energy 

pOO 

Jlq= {[y{t) -yr{t)]'^ + pu'^{t)}dt (4.127) 

Jo 

where p is a positive constant. The problem is then to hnd a global transfer 
function which minimizes this criterion with respect to the control u{t). Assume 
that the process transfer function is of the form 

G(») = (4.128) 

with degD > degN, and polynomials N{s) and D{s) being coprime (having 
no common roots). The polynomial 

A{s) = pD{s) D{-s) + N{s) N{-s) (4.129) 

can only take strictly positive values (for p > 0) when s varies in the 5-plane. 
The roots of A ( 5 ) are symmetrical with respect to the imaginary axis. The 
spectral factorization can be performed such as 

A(5) = A+(5)A-(5) (4.130) 

where A+(s) contains all roots of A ( 5 ) located in the complex left half-plane 
and A“(s) in the complex right half-plane. Spectral factorization can be con- 
sidered in the frequency domain as the dual of the Kalman hlter in the time 
domain (Middleton and Goodwin, 1990) 

Denoting the derivative operator associated with the Laplace variable s by 
(5, from the process transfer function, neglecting the disturbances, the input 
can be written as 



u{t) = D{S) z{t) ; y{t) = N{S) z{t) (4.131) 



where z{t) is the partial state (as in Fig. 4.31). 
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4.10.1 Regulation Behaviour 

The criterion (4.127) then becomes 



poo poo 

Jiq = / [y^ {t) + pu^ {t)]dt = / [N^{5) + pD'^{5)]z'^{t)dt 

Jo Jo 

An important property is that the integral 

pOO 

/ [AT2((i) + pD^{5) - A+^{5)]z^{t)dt 

Jo 



(4.132) 



(4.133) 



does not depend on the path followed by z{t). It results that the criterion 
presented in the form 

pOO pOO 

Jig= [N^{5) + pD^{5) - A+^{5)]z^{t)dt+ A+^ {5) z^ {t)dt (4.134) 

Jo Jo 

is minimum if 

A+^((i) z^{t) = 0 A+((i) z{t) = 0 (4.135) 

hence the optimal control law 

u*{t) = [D{S) - A+((5)] z{t) (4.136) 

The closed-loop transfer function is 

r = in = ^ Nis)ar{s) ^ Nis)a 

i;(s) D(s) S(s) + N(s) E(s) A+(s)T'(s) A+(s) ^ 

SO that the closed-loop poles are the roots of A+(s). From Eq. (4.137), a is a 
scalar such that T(s) = aT^(s). 



4.10.2 Tracking Behaviour 

To ensure zero steady-state error, the robustness equations of pole-placement 
are used, considering that quadratic control constitutes an optimal pole-placement. 
In particular, the set point is given by 

N^(s) 

Yris) = ^ (4.138) 

and the following equations of pole-placement are valid 

D{s)S{s) = N^\s) D%s) ; R{s) - T{s) = R\s) D%s) (4.139) 



The monic polynomial D^{s) is defined as the smallest common multiple 
of D{s) and D^{s) so that we have simultaneously 



D^{s) = D{s)D^ (s) ; 
degn^ ( 5 ) < degD^{s) ; 



D^{s) = D'{s) D^s) 
degD'{s) < degD{s) 



(4.140) 
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Supposing that D^{s) and N{s) are coprime, an extended spectral factor- 
ization is realized 



A^s) = pD^{s) D^{-s) + N{s) N{-s) 

= A"+( 5 ) A"- (5) 



(4.141) 



where A^+(5) contains all roots of A^(5) located in the complex left half-plane 
and A^~{s) in the right half-plane. 

By duality, an optimal observer can be dehned from the spectral factoriza- 
tion 

A^s) = a D{s) D{-s) + N{s) N{-s) 

= A^+(5) A^-{s) 



(4.142) 



where a qualihes noise intensity. Let Ti (s) be a monic polynomial formed with 
any n — 1 roots of A^+(s) (nevertheless, the n — 1 fastest or slowest roots can be 
chosen). By choosing the slowest roots (closer to the imaginary axis) which will 
cancel by pole-zero simplihcation, the closed-loop response is faster. It is also 
possible to proceed differently, as is shown in the second part of the example. 

Pole-placement will be specified by the polynomials i?(5), S{s) = (s) S^(s) 

and T(s) =Ti(s)T^(s). 

A first Bezout equation is then obtained, which gives B(s) and S^(s): 



B(s) S(s) + N(s) R(s) = P(s) 

D{s) D^\s) S\s)AN{s)R{s) = A^+(5)Ti(5) 



(4.143) 



then a second Bezout equation is deduced from Eq. (4.139), which gives T'{s) 
and R\s): 

Ti{s) T\s) + D\s) R\s) = R{s) (4.144) 

The robustness condition D{s)S{s) = (s) D^(s) is fulfilled, as 

D{s)S{s) = D{s) D^' (s) S\s) (4.145) 



contains all the roots of D^{s). 

On the other hand, the closed-loop transfer function is equal to 



r = Afl = ^ N{s)n{s)r{s) ^ N{s)T'{s) 

V(s) D{s)S{s) + N{s)R{s) A«+Ti(s) A<^+ 

(4.146) 

an expression analogous to Eq. (4.137). 

Example 4.6: Linear Quadratic Control for a Continuous Transfer 
Function 

The same open-loop unstable system as for pole-placement is considered with 
transfer function G{s) being equal to 



G{s) 



5 

{s - 1)(5 + 2) 



D{s) 



(4.147) 



with N{s) = 5, D{s) = {s — l)(s + 2). 
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The user’s wish is that the control system allows us to realize a tracking 
when the set point is step-like. This corresponds to 






1 

s 



N%s) 



(4.148) 



with N^{s) = 1, D^{s) = s. The monic polynomial D'^{s), the smallest com- 
mon multiple of D{s) and D^{s) is thus equal to 



£»'”(«) =s(s-l)(s-f2) 



(4.149) 



with (s) = s. 

The extended spectral factorization gives 

A®(s) =pD’^{s)D’^{-s)+N{s)N{-s) 

= — ps® -|- 5ps‘^ — 4ps^ + 25 (4.150) 

= A«+(s)A«-(s) 

hence, by choosing p = 1, 

A®+(s) = -f 4.3308s^ -f 6.878s -f 5 (4.151) 

has the following roots: (—1.0293 ± 1.0682i) and —2.2722. 

The spectral factorization for the observer gives 

A^(s) = a D(s) D{-s) + N{s) N{-s) 

= (75^—5(75^+4(7 + 25 (4.152) 

= A^+(5) A^-(5) 

hence, by choosing a = 1 

A^+(s) = 5^ + 3.97125 + 5.3852 (4.153) 

has the following roots: (—1.9856 ± 1.201H). 

Wolovich (1994) recommends to choose for Ti(5) a monic polynomial formed 
with any n — 1 roots of A^+(5). Here, n = 2 and the conjugate complex roots 
do not allow this choice. We have simply chosen Ti = A^+(5). 

It is then possible to solve the first Bezout equation 

D{s)D^'{s) S'{s)^N{s)R{s) =A^^{s)Ti{s) 

5(5 — 1)(5 + 2) /S'(5) + 5 i?(5) = (4.154) 

(5^ + 4.33085^ + 6.8785 + 5) ( 5 ^ + 3.97125 + 5.3852) 

which gives 

R{s) = 9.21625^+21.04315+5.3852 ; 5^'(5) = 5^+7.3025+24.1598 (4.155) 

Two different cases have been studied in the following : 

First case: 
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The second Bezout equation is solved 



Ti{s) T'{s) + D%s) R'{s) = R{s) 

(§2 + 3.9712s + 5.3852) T'(s) + s R'{s) = 9.2162s2 + 21.0431s + 5.3852 

(4.156) 

which gives 

T'(s) = 1 ; R'{s) = 8.2162s + 17.0719 (4.157) 

The polynomials of the equivalent pole-placement are thus 
R{s) = 9.2162s2 21.0431s 5.3852 

S{s) = D'^'is) S'{s) = + 7.302s 2 24.1598s (4.158) 

T(s) = Ti T'(s) = s2 3.9712s 5.3852 



The resulting closed-loop transfer function is equal to 



Gci{s) 



Y{s) _ NT _ NTiT' _ NT' 

i;(s) ~ DS + NR ~ A«+Ti “ 

5 

s3 -f 4.3308s2 -F 6.878s -f 5 



(4.159) 



the very good step response of which is shown in Fig. 4.32 and presents a very 
slight overshoot. 

Second case: 

The polynomial Ti(s), which will be used in the second Bezout equation, is 
replaced by Ti(s) obtained from the n — 1 slowest roots of A®+(s)A°+(s). In 
fact, in the present case, two complex roots are chosen, resulting in 



Ti(s) = (s -f 1.0293 -f 1.0682i)(s -f 1.0293 
The second Bezout equation is solved 



1.0682*) = s^ -F 2.0586s -f 2.2005 

(4.160) 



Ti(s)T'(s) -f T)'=(s)i?'(s) =R{s) 

(s2 -F 2.0586s 2.2005) T'(s) s R'{s) = 9.2162s2 21.0431s 5.3852 

(4.161) 

which gives 

T'(s) = 2.4472 ; R' {s) = 6.7689s -f 16.0052 (4.162) 

The polynomials of the equivalent pole-placement are thus 
R{s) = 9.2162s2 -F 21.0431s -f 5.3852 

S'(s) = T>'='(s) S"(s) = s^ -F 7.302 s 2 24.1598s (4.163) 

T(s) = fi(s) T'(s) = 2.4472 s 2 5.0379s -f 5.3852 

The resulting closed-loop transfer function is equal to 

T(s) NT NTi T 

TAs) “ DS + NR ~ A«+Ti 
12.2360 

s3 -f 6.2434s2 -F 14.409s -f 12.2360 



Gci{s) 



(4.164) 
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the step response of which (faster and without any overshoot compared to the 
hrst case) is shown in Fig. 4.32. It is also possible to compare the obtained 
curves with the curve obtained from the pole-placement with reference model 
(Fig. 4.30). 




Figure 4.32: Polynomial linear quadratic control: response to a set point unit 
step in the two cases (hrst case at top, second case at bottom) 
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Chapter 5 

Frequency Analysis 



In Chap. 4, it was shown that to get a system transfer function, a possibility is 
to subject the system to a sinusoidal input (Fig. 5.1); after a sufficiently long 
time, the output is a sustained sinusoid of same period, but presents a different 
amplitude and a phase angle. 



u(t) = exp (j cut) 


G(s) 


P(t) = G{jui) exp(ja;t) 


U(ju;) 


Y{juj) = G{juj) U{juj) ' 



Figure 5.1: Frequency response analysis of a linear system 

The study of the characteristics of this sinusoidal output constitutes the analysis 
of the frequency response of linear processes. 

The end of the chapter is dedicated to the introduction of the robustness 
concept and its use in the synthesis of a controller. 



5.1 Response of a Linear System to a Sinusoidal 
Input 

5.1.1 Case of a First-Order Process 



To better understand the phenomenon, it is easier to reason about a simple con- 
crete case. Consider a first-order process subjected to a sinusoidal input. The 
output Laplace transform is related to the Laplace transform of the sinusoidal 
input u{t) = Asm{iot) by the transfer function 



y(s) = G(s) U{s) 



Kp A (jO 

Tp S 1 5^ + 



(5.1) 



To know the shape of the time response, Y (s) is decomposed into a sum of 
fractions that have remarkable denominators; e.g. the following decomposition 
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can be chosen 



y(s) 



a 

+ 1 



+ 



bs c 

5^ + 



(5.2) 



By use of the values of the poles and identification, the coefficients a, 6, c are 
obtained 



y(s) = 



KpA 






T UJ S 



+ 1 T5+1 5^ + CJ" 



+ 



UJ 



+ UJ^ 



(5.3) 



resulting in the output y(t) decomposed in natural response and forced response 



yit) 



2 ^ 2 "^ [to; exp(-t)] + [-ra;cos(a;t) +sin(a;t)] 

Vn{t) + yf{t) 

(5.4) 





Figure 5.2: Response of a first-order system {Kp = 2,r^ = 5) to a sinusoidal 
input (A = 2,cj = 1). Top: input u. Bottom: output y 



After a sufficiently long time (Fig. 5.2), the exponential natural response be- 
comes negligible, so that there remains the two sinusoidal terms of the forced 
response, which can be gathered in the form 



y{t) = Aa sm{ujt + (/)) 



(5.5) 



to take into account the amplitude ratio and the phase angle. Identification 
gives 



acos((^) = 






and 



a = 



+ 1 



Kp 

uj‘^ + 1 



and asin((^) = 

uj^ 1 



and (/) = arctg(— rcj) 



(5.6) 

(5.7) 
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It is important to note that Eq. (5.6) defines (j) modulo 2 /ctt while Eq. (5.7) 
defines modulo kn only. As it corresponds to a physical system, the output 
is always delayed with respect to the input and the phase angle is defined as a 
phase lag (see Bode plots). 

The response has the same period as the input; the amplitude ratio (coef- 
ficient a) and the phase angle depend on pulsation uo (related to the frequency 
by cj = 2 7TZ/). 

Looking at the process transfer function 



G{s) 



Kp 

r 5 + 1 



(5.8) 



one realizes that: 

• The amplitude ratio is equal to the modulus of G{j u). 

• The phase angle is equal to the argument of G{j u). 

This result is, in fact, valid not only in the case of a first-order system, but in 
the case of any linear system. 

5.1.2 Note on Complex Numbers 




Figure 5.3: Geometric representation of a complex number with abscissa, 
ordinate, modulus and argument 



A complex number z is defined by 

z = X jy with = — 1 



(5.9) 



where x is the real part and y the imaginary part. The complex number z 
corresponds to the representation (Eig. 5.3) in the complex plane (Ox, Oy) 
of vector OM (coordinates x, y) where the real part is the abscissa and the 
imaginary part the ordinate, z can also be expressed in the form 



z = pexp{j(j)) with: p = \z\ = y/x^ + ; (j) = (Ox, OM) 

corresponding to: x = pcos((^) ; y = psin((^) 

Often, z is given by its modulus and its argument (j) as 

|z| = \/ x‘^ -\- y‘^ ; <p = arctang( — ) 

X 



(5.10) 



(5.11) 
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p is called the modulus of z aud argumeut (or augle) of z. Note that Eq. (5.10) 
defiues modulo 2 /ctt while Eq. (5.11) defiues modulo kn ouly. Therefore, 
some iuformatiou is lost iu the passage from Eq. (5.10) to Eq. (5.11). 

Let aud Z 2 be two complex uumbers. The followiug properties couceruiug 
the modules aud argumeuts of products aud quotieuts of complex uumbers are 
importaut 



\zi Z2I = \zi\ |Z2| 
Zl _ \zi\ 

Z2 \Z2\ 



aud 

aud 



arg (zi Z 2 ) = 01 + ' 



Zl 

arg 1 — ) = 



(5.12) 



i.e. the modulus of the product of two complex uumbers is equal to the product 
of their moduli, aud the argumeut of the product is equal to the sum of their 
argumeuts. The modulus of the quotieut is equal to the ratio of their moduli 
aud the argumeut of the quotieut is equal to the differeuce of their argumeuts. 



5.1.3 Case of any Linear Process 

Iu a geueral mauuer, iu opeu loop, the process output is related to the ma- 
uipulated iuput by the process trausfer fuuctiou. Iu closed loop, the output 
is related to the set poiut or to the disturbauce by a more complicated global 
trausfer fuuctiou. Iu both cases, this trausfer fuuctiou cau be described as the 
ratio of two polyuomials such that the uumerator degree is always lower or 
equal to that of the deuomiuator. With both cases beiug ideutically treated, 
ouly the first case is explaiued 

Y{s)=G{s)U{s) = ^^U{s) (5.13) 

The process is subjected to a siuusoidal iuput u{t) = Asiu(cjt), heuce 



y(s) 



G{s) 



AiO 

5^ + 



P{s) Aio 
Q{s) 5 ^ + 



(5.14) 



The ratio of both polyuomials cau be decomposed iuto a sum of ratioual frac- 
tious by usiug the zeros of Q{s) (poles Si of G{s) ), which leads to 



y(s) 



Cl C2 

S — Si 5 — 52 



+ . . . + 



+ 



S j iO s — j u 



(5.15) 



As we are coucerued with the behaviour of stable systems, this meaus that all 
poles Si of G{s) are uegative real or complex with a uegative real part. The 
output y(t) is composed of the uatural respouse aud the forced respouse. Terms 
comiug from the ratio P/Q correspoud to the stable uatural respouse composed 
of expoueutial fuuctious with real or complex coefficieuts, but iu all cases have 
a damped behaviour wheu t becomes very large. There theu remaius the two 
last terms of the forced respouse, giviug the respouse after a sufficieutly loug 
time 



y(t) = a exp(— j ut) exp(j u t) 



(5.16) 
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To calculate a, it suffices to use the expression of Y (s), multiply by (5 + j cu) 
and set s = —jcu, similarly for 6, hence 

a = = ^j G{-j uj) and b = ^G{j w) = G{j w) (5.17) 

The response y{t) is real and it can be checked that a and b are conjugate 
complex. The response after a sufficiently long time can be written as 



yit) = — j [G{-juj) exp{-jujt) -G{juj) exp(ja;t)] 
and can be simplihed by writing that 

G{ju) = \G{ju)\ exp{j(p) 

giving the output after a sufficiently long time 



y{t) =A\G{ju)\ sin{ujt + (f)) 

The amplitude ratio is equal to 

AR=\G{juj)\ 



and the phase angle 



argument {G{j co)) 



(5.18) 

(5.19) 

(5.20) 

(5.21) 

(5.22) 



5.1.4 Case of Linear Systems in Series 

Consider a global system constituted by n linear subsystems in series (Fig. 
5.4). The transfer function of the global system is the product of the individual 
transfer functions 

G{s) = (^ 1 ( 5 ) (^ 2 ( 5 ) . . . Gn{s) (5.23) 




Figure 5.4: System composed of linear subsystems 

The modulus \G{j io)\ of the global system transfer function is equal to the 
product of the moduli of the individual transfer functions 

^)\ = l^i(i ^)l l^2(j ^)| • • • \Gn{j ^)\ (5.24) 

while the argument of the global system transfer function is equal to the sum 
of the arguments of the individual transfer functions 



arg (G{juj)) = arg {Gi{j uj)) + arg {G2{j uj)) + . . . + arg {Gn{j (^)) (5.25) 
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5.2 Graphical Representation 

The influence of frequency on the modulus of G{j u) and on its argument can 
be represented by different plots, in particular Bode and Nyquist ones. 

5.2.1 Bode Plot 

The Bode plot consists of two parts (see example in Fig. 5.5): in the first plot, 
the logarithm of the modulus of G{j u) is represented versus the frequency 
logarithm; in the second plot placed just below, the argument of G{j u) is 
represented versus the frequency logarithm. 

First-Order System 





Figure 5.5: Bode plot for a first-order system {Kp = 2, r = 5) 



From the transfer function, one infers 



G{s) 



Kp 

r 5 + 1 






Kp 

T j UO -\- 1 



giving the modulus and argument of G{jio), respectively 



(5.26) 



\G{ju;)\ 



Kp 

+ 1 



(5.27) 



<p = arctg(— rcj) (5.28) 

When u is small (low frequencies), the modulus of G{j u) is equivalent to the 
constant process gain Kp (low-frequency asymptote) and the argument (j) (phase 
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lag) tends towards 0 (Fig. 5.5). When co is large (high frequencies), the modulus 
of G{juj) is equivalent to 1/cj, thus in the log— log plot [log(|G(j cj)|), log(cj)] 
this gives a straight line of slope —1 (high-frequency asymptote), or, in decibels, 
-20dB/decade, and the argument (phase lag) tends towards — 7t/2. 

The modulus expressed in decibels is simply proportional to the logarithm 
of the modulus expressed without units 

|G(ia>)ldB=201og(|G(ja>)|) (5.29) 



Second-Order System 

A second-order transfer function is equal to 



G(s) — 






G{juj) = 



Kr, 



+ 2 (tS + 1 1— T^Uj'^+j2(TUJ 

giving the modulus and argument of G{ju>), respectively 



\GUu;)\ = 



Kr, 



Y^(l— 



= — arctg( 



1 — 



) 



(5.30) 



(5.31) 

(5.32) 




Figure 5.6: Bode plot for a second-order system for different values of ( 
(iC^ = 2,r = 5,C = 0.2;0.5;l;2;) 

When io is small (low frequencies), the modulus of G{j u) is equivalent to the 
process constant gain Kp and the argument (j) (phase lag) tends towards 0 (Fig. 
5.6). When u is large (high frequencies), the modulus of G{j u) is equivalent 
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to thus in the log— log plot, it corresponds to a straight line of slope —2 

and the argument (phase lag) tends towards — tt. 

When a plot is drawn for different values of (, the gain curve presents a re- 
sonance (amplitude ratio higher than its value on the low-frequency asymptote) 
for ( = 0.2 and ( = 0.5 at the corner (or break) frequency cj = y/l — jr^ 
which disappears for C, > \/2/2. To check it, r being fixed, it is sufficient to 
search the maximum of \G{ju)\ with respect to u. This maximum is equal to 

KV(2Cv/1^^)- 

5.2.2 n-th-Order System 

Consider an n-th-order system with transfer function G{s). At low frequency, 
the modulus of G{j u) tends towards the process constant gain Kp and the 
argument (j) tends towards 0. At high frequency {u large), the modulus of 
G{j u) is equivalent to 1 /cj^, thus in the log— log plot, it corresponds to a 
straight line of slope — n and the argument tends towards — n7r/2. 



Inverse Response System 

Inverse response systems are considered as nonminimum-phase systems. To 
understand this problem, the influences of the numerator and denominator of 
a transfer function present under the form 



G{s) 



(2 5 T 1 
bs 1 




are decomposed, where a is positive. 

A term as {b s 1) in the transfer function denominator is called a process 
“lag” (or delay), as the output is delayed with respect to the input: the phase 
angle (j) is negative. On the other hand, a term as {a s 1) in the transfer 
function numerator is called process “lead” (or advance), as the output is in 
advance with respect to the input: the phase angle (j) is positive. Thus, transfer 
functions as (5.33) are called “lag— lead”. 

Suppose that the transfer function is simply 



(2 5 T 1 



It follows that 

\G{jto)\ = \/a^uP‘ + 1 (5.35) 

(\) = SiTg{G{jio)) = arctg(acj) (5.36) 

The amplitude ratio has a positive slope equal to +I at high frequencies, thus 
(j) is included between 0 and +7 t/ 2. This implies that the amplitude ratio tends 
towards infinity when frequency becomes infinitely large, which has no physical 
sense. This corresponds to a negative zero for the process. 

If the system transfer function is 



G{s) 



1 — as 
bs 1 
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with a > 0, the system now presents a positive zero. Then it is an inverse 
response system. Notice that the amplitude ratio is the same as previously, 
but that the phase angle corresponding to the numerator is the opposite: = 

— arctg(acj). A positive zero (in the right half-plane) contributes to a phase 
lag of the global frequency response. 

Processes having a zero in the right half-plane or a pure delay are called 
nonminimum-phase because they present more phase lag than any transfer 
function that have the same amplitude ratio. 

System with Time Delay 




Figure 5.7: Bode plot for a hrst-order system with time delay {Kp = 2,r^ = 
5,t^ = 20,iCe = 2) 

The delay 6 is characterized by the presence of a nonlinear term of the 
form exp {—Os) in the transfer function, which can be written in the form of 
a product of the exponential term and a transfer function without time delay 
denoted by G^d 



G{s) = exp {-Os) G^ijd{s) 
The transfer function for s = jco is then equal to 


(5.37) 


G{juj) = exp {-jujO) Gyjd{j^) 


(5.38) 


thus its modulus is equal to 




\G{ji^)\ = \Gyjd{j^)\ 


(5.39) 


and its argument 

arg((T(jcj)) = ^rg{Gy,d{j^)) ~ ^0 


(5.40) 
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thus the phase angle is not bounded (Fig. 5.7). This system is nonminimum- 
phase. 



5.2.3 Nyquist Plot 

The Nyquist plot (Fig. 5.14) consists of representing in the complex plane, in 
abscissa, the real part of G{j u) and, in ordinate, the imaginary part of G{j u). 
Thus, the locus is that of a point M such that the vector OM corresponds to 
the complex number G{jio). It will verify OM = \G{j io)\ and (Ox^OM) = 
SiTg{G{j io)). The Nyquist plot gives the same information as the Bode plot, 
but in a different form. 



Im(G(ja;)) 

]gG)\ 




Re{G{jLo)) 



Figure 5.8: Principle of Nyquist representation 



First-Order System 

The transfer function of a first-order system 



t 

II 


(5.41) 




(5.42) 




(5.43) 



Notice that the real part is always positive and the imaginary part always 
negative, thus the curve is entirely situated in the lower-right quadrant (Fig. 
5.9); the angle (Ox^OM) = arg((T(jcj)) is included between 0 and — 7t/2. 
In this figure, the locus is represented as a full line for positive frequencies 
and as a dotted line for negative frequencies. When the pulsation co is zero, 
the representative point is at Kp on the x axis; then it moves towards the 
coordinates’ origin (limit for co tending towards infinity). The tangent at the 
coordinates’ origin is vertical. The locus for positive frequencies is, in fact, a 
half-circle centred on the abscissa axis. 
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The imaginary part is always negative, thus the curve is located in the lower 
half-plane (Fig. 5.10) ; the angle (Ox^OM) = arg((T(jcj)) is included between 
0 and — 7T. When the pulsation uu is zero, the point is in Kp on the x axis; then 
it moves towards the coordinates’ origin (limit for co tending towards infinity). 
The tangent at the coordinates’ origin is horizontal. 

5.2.4 n-th-Order System 

Consider a general n-th-order system with transfer function G{s). At low fre- 
quency, (cj tends towards 0), the representative point M in the Nyquist plane 
is on the abscissa axis with its abscissa equal to the steady-state gain of the 
transfer function. When co increases, the point moves with a decreasing mo- 
dulus p = OM and a negative argument. At high frequency, the point tends 
towards the origin of the coordinates. Finally, the angle (Ox, OM) describes 
an angle of —n7rl2 when co increases from 0 to oo. 



System with Time Delay 




Figure 5.11: Nyquist plot for a first-order system with time delay {Kp = 
2, Tp = 5,td = 20, Kc = 2) 

Taking again previous notations for the transfer function with time delay 
G(s) = exp {-6s) Gy,d{s) G{juj) = exp (-jivd) G^dij^) (5-47) 

the real and imaginary parts are deduced 

Re[G(ja;)] = Re[G^„d{j cos{u6) +lm[G^„d{j sin(a;6') (5.48) 

Im[G(ja;)] = Im[Gtt,d(j w)] cos(a;6') - Re[G^„d{j sin(a;6') (5.49) 
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Due to the presence of periodical cosinus and sinus factors in the expressions 
of the real and imaginary parts of the transfer function with time delay, and 
also due to the decrease towards zero of the modulus of Gwd(j^) when the 
frequency increases, the curve in the Nyquist plot for a hrst-order transfer 
function with time delay has a spiral aspect. When the frequency tends towards 
0, the representative point of maximum modulus of G is far on the right on 
the abscissa axis, and as soon as frequency increases, the representative point 
converges towards the origin of the coordinates (Fig. 5.11). 



5.2.5 Black Plot 

The Black plot of any transfer function is obtained by choosing as abscissa the 
argument of transfer function aig{G{juj)) according to a linear scale and as 
ordinate the modulus \G{ju)\ according to a logarithmic scale. 



5.3 Characterization of a System by Frequency 
Analysis 

Previously, it was shown that it is possible to characterize linear systems by the 
analysis of the frequency response to a sinusoidal input, knowing the process 
transfer function. 

In fact, the inverse problem is often set as: given the values of a real process 
frequency response, what transfer function represents this process? This is an 
identihcation question. 

The curves or the values of a real process frequency response can be obtained 
in different ways: 

• A direct sinusoidal input: a series of runs with sinusoids of different fre- 
quencies; the amplitude ratio and the phase angle are directly determined 
from input and output signals. 

• Impulses which theoretically provide the same information but need a math- 
ematical treatment. 

• Random or pseudo-random inputs (pseudo-random binary or ternary se- 
quences) used in advanced identification techniques. 

Transfer functions can be deduced from the frequency response in two ways: 

• By a simple reasoning on the slopes of the amplitude ratio, the limits of the 
phase angle (bounded or not). 

• By mathematical treatment, e.g. by using a least squares method after 
transformation of the transfer function. 

In the following, it is assumed that the process transfer function is well known. 
The question is then to design the controller. The first condition that the 
controller must fulfil is stability. The aim will be to get suitable characteristics 
of the time response. 
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5.4 Frequency Response of Feedback Controllers 

5.4.1 Proportional Controller 

From the controller transfer function 

^c(5) = K, (5.50) 

it follows that the modulus of Gc{jio) is equal to the controller gain Kc and 
that the phase angle is zero for any frequency. 

5.4.2 Proportional-Integral Controller 




Figure 5.12: Bode plot for a PI controller {Kc = 2 , r/ = 5) 

The PI controller transfer function is equal to 

Gc{s) = Kc ^I + (5.51) 

Replacing s by jcj, the amplitude ratio results 

AR = \Gc{jio)\ = KcJ I H — 2 — 2 (5.52) 

V 



^ = arg((Tc(jcj)) 



arctg( ) 

TjUJ 



(5.53) 



and the phase angle 
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At low frequencies (Fig. 5.12), the integral action is dominant, the slope of the 
straight line of the amplitude ratio is equal to —1 and the phase angle is — 7t/2. 
When the frequency is high, the PI controller behaves as a simple proportional 
controller, the amplitude ratio tends towards Kc (slope = 0) and the phase 
angle towards 0. 

5.4.3 Ideal Proportional-Derivative Controller 




Figure 5.13: Bode plot for an ideal PD controller {Kc = 2 ^ td = ^) 

Although this type of controller is not physically realizable, it is interesting to 
look at its Bode plot in order to extract the characteristics of the derivative 
action. The controller transfer function is equal to 

Gc{s) = Kc {l-\-T]js) (5.54) 

Replacing s by jcj, the amplitude ratio results 

AR = |Ge(jw)| = (5.55) 

and the phase angle 

^ = d.ig{Gc{juj)) = arctg(rDCj) (5.56) 

is positive for any value of cj, thus it constitutes an advance. 

At low frequencies (Fig. 5.13), the PD controller behaves as a simple pro- 
portional controller, the amplitude ratio tends towards Kc (slope = 0) and the 
phase angle towards 0. 
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Figure 5.14: Influence of noise on a system controlled by an ideal PD controller 



At high frequencies, the derivative action dominates, the slope of the straight 
line of the amplitude ratio is equal to +1 and the phase angle is +7t/2. 
Assume that a sinusoidal measurement noise of the form rj = a sin ujt influences 
a system controlled by an ideal PD controller (Fig. 5.14). Considering only 
the pure derivative action of the PD controller, the presence of the noise adds 

djTj 

to the input u a signal Uj^ equal to Uj^ = —KcTd— = —aKcTDOJ cos(cjt), so 

that if the noise contains high frequencies, the control signal will have a very 
large amplitude. To solve this problem, the ideal derivative action is modified 
by filtering with a first-order filter of time constant td/N so that the real 
derivative transfer function will be in the form 



Kc 



Tp S 

Tp s/N + 1 



(5.57) 



or in a slightly different form 

Kc A ^ 5 !3 = l/N 

P T£) s + 1 



(5.58) 



The derivative action is thus only sensitive at low and mean frequencies. The 
gain is limited by N at high frequencies and consequently also the measure- 
ment noise at high frequencies. Real PID controllers will cut the gain at high 
frequencies. 

The PD controller is seldom used alone, but an integral action is always 
added. A PD controller is only usable alone when the process is a pure inte- 
grating system. 



5.4.4 Proportional-Integral-Derivative Controller 

Ideal PID Controller 

The transfer function of the ideal PID controller is equal to 

= Kc ^1 + (5.59) 

Because of the derivative term, this controller is not physically realizable. By 
replacing s by jcj, the amplitude ratio results 




Process Control 



199 




Figure 5.15: Bode plot for an ideal PID controller (Kc = 2 , r/ = 5 , ri:) = 4) 



and the phase angle 



(j) = arg(G(ja;)) = arctg(TDa; ) 

TjUJ 



(5.61) 



At low frequencies (Fig. 5.15), the PID controller behaves as a PI controller; 
the integral action prevails, the amplitude ratio has a slope equal to —1 and 
the phase angle tends towards — 7 t/2. 

At high frequencies, the derivative action prevails, the slope of the straight 
line of the amplitude ratio is equal to +1 and the phase angle tends towards 
+7 t/ 2. Thus, the phase angle change is tt. 



Real PID Controller 



With respect to the ideal PID controller, in a real PID controller (Figs. 2.26 
and 2.108), the derivative action is modihed by hltering by a hrst-order hlter 
in order to limit the influence of the high-frequency noise. The controller will 
then be physically realizable. 

The transfer function of the real PID controller (pneumatic or electronic) 
is equal to 



^c(^) = K, 



I + T/ 5 

T/ S 



TpS^l 

(3tds^I 



(5.62) 



The values of parameter (3 are often included between 0 and I. Its transfer 
function corresponds, in fact, to the product of the transfer function of a PI 
and that of a PD. 
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Figure 5.16: Bode plot for a real PID controller (Kc = 2 , r/ = 5 ^ td = 
4 , /3 = 0.1 (curves a), and (3 = 0.5 (curves b)) 



After rearrangement, this equation can be written in the form 
= (3td\ + 1 

The first fraction acts as a first-order filter on a controller that would have 
its parameters K* , Tj , equal to 

K: = kJ^ + i) ; Tf=TD+Ti ; = (5.64) 

\Tl J Td ^Ti 

With respect to the ideal PID, the high-frequency behaviour is modified, as the 
asymptote of the amplitude ratio is then bounded 

lim AR = ^ (5.65) 

(Jj^oo p 

For this reason, the controller is less sensitive to the noise. Like in the PD 
controller, the amplitude ratio is bounded at high frequencies. The derivative 
action is then limited in the high-frequency domain, which is an advantage as 
that limits the high-frequency noise influence. Figure 5.16 for (3 = 0.1 shows 
that the amplitude ratio is bounded when co becomes larger than about 10 
rad/s. If larger values of (3 are chosen, the increase of the amplitude ratio 
is attenuated and the phase bump disappears {(3 = 0.5). For (3 = 0.9, the 
behaviour is very close to a PI controller. 
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5.5 Bode Stability Criterion 

The stability of the closed-loop system is characterized by the solutions of the 
characteristic Eq. (3.41), which can amount to study of the open-loop transfer 
function Goi{s). The Bode plot of Goi{s) is a means of realizing this stability 
study in frequency analysis. Tuning of the PID controller will result. 

An important point is that the Bode criterion can only be applied to 
minimum-phase transfer functions. A system is considered to be minimum- 
phase if all its poles and zeros are in the left half complex plane. In the opposite 
case, if at least one pole or zero is in the right half complex plane, the system 
is said to be nonminimum-phase (Shinners, 1992). In that case, the stability 
study will be made e.g. by means of the root locus. 

This assertion can sometimes be tempered and the Bode criterion can be 
applied to some nonminimum-phase systems when the reason of that behaviour 
is known (e.g. the presence of a pure delay). 

Consider the hrst-order process with time delay of Fig. 5.17 for which the 
following parameters are chosen Kp = 2, r = 5, = 20. It is assumed that 

both transfer functions of the actuator and of the measurement device are equal 
to 1: (Ta = 1, Gm = 1- As an aside, the controller is set in proportional mode 
and has gain Kc = 2. The process transfer function is equal to 

Gp{s) — Kp (5.66) 

r 5 + 1 

giving the following open-loop transfer function 

Goi(s) = Gc(s)Ga(s)Gp(s)G^(s) = (5.67) 

Note that Goi (s) is the transfer function of the loop when the latter is considered 

open. This transfer function is the same as that which plays a key role in the 
denominator of the closed-loop transfer function and thus of the characteristic 
equation. The Bode plot of this transfer function shows that because of the 
exponential term, the phase angle is not bounded and that for a value of the 
frequency called the phase crossover frequency, it is equal to — tt (Fig. 5.18) 

arg {Goi{juj4>)) = -tt (5.68) 

According to Fig. 5.18 when the frequency is equal to the phase crossover fre- 
quency U(f) ~ 0.13 rad/s, the amplitude ratio from the Bode plot (gain curve) is 
equal to around \Goi{joO(f))\ = {AR)(f) 3.36. From the expression of the trans- 

fer function, the amplitude ratio is proportional to the gain of the proportional 
controller, which is taken equal to 2. It results that 

\Goi\ ocKc^ \Goi{jlo^)\ ^ ^ 1.68 (5.69) 

J\c 2 



In the absence of a Bode plot for the phase angle or of a numerical solver, 
the phase crossover frequency can be determined by the study of the function 
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Figure 5.17: First-order system with delay in closed loop 



SiTg{Goi{jio)) versus cj, the latter varying in general in the range [10“^, 10+^] 
rad/time unit for typical chemical processes. If the phase crossover frequency 
is not found in this range, the latter can be somewhat enlarged. The previous 
example would thus give Table 5.1. 



Table 5.1: Study of the function arg versus co 



Frequency (rad/s) 


10“^ 


10“' 


0.1285 


1 


10' 


10^ 


Phase angle (°) 


-14.32 


-141.15 


-180 


-1224 


-11548 


-114681 




Figure 5.18: Bode plot of the transfer function of the open loop for a first- 
order system with time delay {Kc = 2 , Kp = 2 , = 5 , = 20) 

The gain of the proportional controller is susceptible to taking any positive 
value. If the gain Kc is chosen to be equal to the ultimate gain {Kc)u defined 
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as 



thus 



{Kc)u ^ 



Kc 

\Goi {j^(f))\ 



2 

3M 



0.595 



(5.70) 

(5.71) 



the amplitude ratio becomes equal to 1 and the measured output ym is sinu- 
soidal 



ym = ^ sin(0.13 1 — 7t) = — sin(0.13t) (5.72) 



In fact, the Bode plot is built for the open-loop system response to a sinusoidal 
set point. By choosing Kc = {Kc)u^ the measured output is then exactly in 
phase opposition with the sinusoidal set point and has the same amplitude. 
The gain Kc of the proportional controller, such that the modulus of the open- 
loop transfer function GqI (or amplitude ratio) is equal to 1, is called the ulti- 
mate gain {Kc)u (Fig- 5.23). Any value of the gain larger than the ultimate 
gain makes the system unstable. 

Consider again the characteristic equation 



l^Goi=0 (5.73) 

It can be noticed that the hrst member becomes effectively zero for the phase 
crossover frequency and the controller gain, such that the amplitude ratio is 
equal to I. Thus, one solution of this equation has been found (this equation 
may possess several of an infinity of solutions for some systems). 

Now, consider the following: 

• As a response to a sinusoidal set point, the system has been brought into 
a state of sustained oscillation for a value of the phase crossover frequency; 
moreover, the system amplitude ratio is equal to I for a particular gain of 
the controller. 

• This value of the phase crossover frequency and this gain are kept. 

• The loop, which was previously closed, is opened. 

• Instantaneously, the set point is set to zero yr = 0. In this case, nothing is 
changed, as the comparator changes the sign of the measured output, which 
is then equal to the previous set point. Thus, theoretically, the response 
must oscillate as a perfect sinusoid. 

This point corresponding to the phase crossover frequency uj(^ and to the ulti- 
mate gain (limit gain) of the controller such that the amplitude ratio is equal 
to I is thus a limit point for the system 

• IfiFe cu — 0.595, then the amplitude ratio is larger than I for cj) — tt 
and the oscillation of the system will have an increasing amplitude; the 
system is thus unstable. 

• If Kc < Kcu = 0.595, then the amplitude ratio is lower than I for (^ = — tt 
and the oscillation of the system will have an decreasing amplitude; the 
system is thus stable. 

Figure 5.19 thus shows this infiuence of the proportional gain in the neighbour- 
hood of the ultimate gain in the response to a step. 
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Figure 5.19: Response to a unit-step of the first-order delayed system {Kp = 
2 ^ Tp = 5 ^ td = 20), for three close values of the proportional gain (left = 
0.58, centre = 0.595, right = 0.61). Three regimes are displayed from 
left to right: stability, sustained oscillation, instability 



The Bode stability criterion results: a feedback control system is unsta- 

ble if the amplitude ratio of the corresponding open-loop transfer 
function is larger than 1 at the phase crossover frequency. 

The Bode stability criterion cannot be applied to systems having an ampli- 
tude ratio or a phase angle which does not vary monotonously with respect to 
frequency. 

It may happen that a closed-loop system is unstable for a low value of the 
gain Kc and that it becomes stable for a larger value of the gain Kc. This is 
the case of unstable open-loop systems. Consider, for example, the following 
system 

^ 

With a positive pole, this system is open-loop unstable. Its phase angle is 
between —180° and —90° (Fig. 5.20). This system becomes closed-loop stable 
since Kc > 2. 

It may also happen that the argument of \Goi{joo) takes the value — tt for 
more than one frequency u. In this case, it is necessary to use the Nyquist 
criterion (Sect. 5.7). 

To understand the limitations of Bode criterion, it is recommended to consider 
the Nyquist plot and the Nyquist criterion. As the Bode plot is easier to 
obtain manually, it was often preferred with respect to the Nyquist plot. With 
a computer and a specialized package, this problem disappears. 

For the sustained oscillation, which was obtained at the phase crossover 
frequency for a given value of the controller gain, we must recall the Ziegler- 
Nichols tuning method of the controller, which was mentioned in Sect. 4.5.2. 

For a second-order system regulated by a real PID, the Bode plot of the 
open-loop transfer function (Fig. 5.21) shows that the phase angle remains 
between 0 and 180°, thus the system is stable, which can be checked with the 
closed-loop response to a set point variation. 

Note that an open-loop second-order system without delay has a phase angle 
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Figure 5.20: Bode plot of the open-loop transfer function of the unstable 
process (5.74) 





Figure 5.21: Bode plot of the open-loop transfer function for a second-order 
process {Kp = 2,r = 5,(“ = 0.5) plus a real PID controller (Kc = 10 , r/ = 
5, td =4, /3 = 0.1) 



always larger than — tt and thus does not present any phase crossover frequency. 
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Let the open-loop third-order system 



Goi 



Kc 



(5.75) 



with Kc = 10. From Fig. 5.22, it appears that the phase angle varies between 
0 and — 37 t/ 2 and thus this system presents a phase crossover frequency ~ 2 
rad/s. 




Figure 5.22: Bode plot of the open-loop third-order process transfer function 
1/(25^ + 55^ +75 + 1) plus a proportional controller (Kc = 10) 



Conclusion: to present a phase crossover frequency, the open-loop transfer 
function GqI must be at least of third order or present a delay. In the latter 
case, it can have any order. 



5.6 Gain and Phase Margins 

The model of a system is always determined with uncertainty. Moreover, the 
system is susceptible to shift with time. Also, disturbances are present. For 
these reasons, it is easy to understand that it is unsafe to operate a controller 
too close to the limit value previously found; a safety margin will have to be 
provided. To calculate the gain margin and the phase margin, the controller 
is first set in proportional mode as in the application of the Bode criterion to 
determine the ultimate gain. This is also related to the tuning by trial and 
error (Sect. 4.5.1). 
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5.6.1 Gain Margin 



Let AR(f) be the amplitude ratio at the phase crossover frequency uj(^ (Fig. 
5.23); the gain margin is dehned as 



GM = 



1 



1 

\Goi 



(5.76) 



\og{\AR\) 




Figure 5.23: Gain and phase margins on Bode plot 

If the open-loop transfer function is calculated with the gain of the pro- 
portional controller equal to 1, the gain margin is equal to the ultimate gain 
{Kc)u- Otherwise, it is equal to 

GM = 



K, 



(5.77) 
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where Kc is the gain of the proportional controller. 

Frequently, the gain margin is expressed in decibels 

= 20 logio(i^c)u - 20 logio(i^c) (5.78) 

As the amplitude ratio must be smaller than 1 at the phase crossover fre- 
quency, it results that the gain margin must be larger than 1. The larger this 
value is compared to 1, the further the system is from instability: it becomes 
robust at the expense of performance. 

A recommended value of the gain margin is between 1.7 and 2. 

5.6.2 Phase Margin 

When the amplitude ratio is equal to 1 , the corresponding frequency cOg is called 
the gain crossover frequency 



\Goi{jujg)\ = l (5.79) 

and the phase angle is equal to (Fig. 5.23) 

4>g = ^rg{GolU^g)) (5-80) 

If (l)g is lower than — tt, the system is unstable; thus this phase angle will 
have to be larger than — tt. The difference 

PM = 7T—\(j)g\ (5.81) 

constitutes the phase margin. 

A recommended value of the phase margin is between 30 and 45°. 

Recall the tunings recommended by Ziegler and Nichols (Table 5.2) where 
Kcu is the ultimate gain and Tu the ultimate period 

T„ = 2tt/uj^ (5.82) 

corresponding to the sustained sinusoidal oscillation. 

Table 5.2: Tunings recommended by Ziegler and Nichols 



Controller 


Kc 


Tl 


Td 


P 


0.5 Kcu 






PI 


0.45 Kcu 


T„/l,2 




PID 


0.6 Kcu 


T„/2 


Tu/8 



The value O.bKcu corresponds, in fact, to a gain margin of 2, while 0.6 Kcu 
corresponds to a gain margin of 1.7. 

When the values recommended by Ziegler-Nichols are taken into account, 
the PI controller transfer function is equal, at the phase crossover frequency, to 

=0.45 Ke«(l =0.45 Ke«(l -0.191 j) (5.83) 

Z 7T 
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which corresponds to a phase lag of —10.8°, while for the PID controller it is 
equal to 

=0.6Ke«(l =0.6Ke«(l+0.467j) (5.84) 

Z 7T O 

which corresponds to a phase advance of 25°. 

A reproach often directed at Ziegler-Nichols tuning is that the corresponding 
controller realizes insufficient damping, so that later a more robust design must 
be performed. 

Example 5.1: Ziegler-Nichols Tuning 

Consider a very simple example of a hrst-order system with a time delay {Kp = 
2 ; r = 10 ; td = having the following transfer function 

= — C exp(-tds) (5.85) 

Ts + 1 

and calculate the gain and phase margins for different types of controllers (Table 
5.3) tuned by use of the Ziegler-Nichols recommendations. For all the cases, of 
course, the critical frequency uj(^ remains the same. The Bode criterion yields 
Kcu = 2.945 and uj(^ = 0.5805 rad/time unit. 



Table 5.3: Gain and phase margins for different types of controllers tuned 
according to the Ziegler-Nichols recommendations for a given first-order process 
with time delay 



Controller 


Kc 


Tl 


Td 


Gain 

margin 


Phase 
margin (°) 


CJg 


P 


Kqu 






1 


0 


0.5805 


P 


0-5 Kcu 






2 


62.2 


0.2770 


PI 


0A5Kcu 


T„/l,2 




2.18 


64.2 


0.2687 


real PID 


0.6 Kcu 


T„/2 


Tu/8 with 


1.47 


47.1 


0.3439 


(following) 






N = 20 
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Example 5.2: Chemical Reactor PID Control Using Ziegler-Nichols 
Tuning 



Feed 











<]l 



Outlet 



A continuous perfectly stirred chemical 
reactor with a nonlinear reaction is used 
to show the performances of the Ziegler- 
Nichols tuning for a PID. 



mical reactor is simply represented by the following equation 



^-^{CA0-CA)-kCl 



(5.86) 



The objective is to control the reactor concentration by manipulating the feed 
flow rate. The feed concentration is a disturbance. The parameters and steady- 
state conditions are given in Table 5.4. 



Table 5.4: Parameters and steady state (with superscript conditions of the 
CSTR 



Nominal flow rate of the feed 


Fq = 5 X 10“^ m^.s“^ 


Nominal concentration of reactant A in the feed 


C%Q = 3000 mol.m“^ 


Volume of reactor 


V = 0.1 m^ 


Kinetic constant 


k = 10“^ m^.mol“^.s“^ 


Steady-state concentration of reactant A 


= 363.1 mol.m“^ 



The volume of the reactor is assumed to be perfectly controlled. The nonlinear 
model is first linearized, resulting in the following Laplace equation 



Ca(5) 



s H — — — \- 2 k 



F{s)F 






F^ 

s H — — — \- 2 k 



Cao{s) 



(5.87) 



The actuator transfer and measurement functions are assumed to be, respecti- 
vely 






0.1 

55 + 1 



0.02 exp(— 5s) 
0.55+1 



(5.88) 



In the simulation, the process is represented by the nonlinear model while 
the PID tuning is strictly based on the linearized model: the critical frequency 
and the ultimate gain used in the Ziegler-Nichols table are drawn from the 
corresponding Bode plot of the open-loop transfer function. 

Three cases are studied: 
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1. The measurement delay is neglected and a real PID (Fig. 2.27) is used. 

2. The measurement delay is neglected and a real PID with anti- windup (Fig. 
4.13) is used. 

3. The measurement delay = 5 s is considered and the real PID of case 
1 is used. The results have shown that the improvement by adding an 
anti-windup is negligible in this particular case. 

The parameter N for the real PID is taken as N = 10. For the anti- windup, 
the time constant is = 2 and the inlet flow rate is assumed to be bound in 
the range [0 , 2 x 10“^]. 

The different characteristics and tunings are gathered in Table 5.5. 



Table 5.5: PID tuning by Ziegler-Nichols rule in the absence and in the 
presence of measurement delay 



Measurement d 
Critical frequency 
Ultimate gain 
PID Tuning by 
Kc = 0.0361 ; n = 


ielay neglected 
U(f) = 0.755 rad/s 
Kcu = 0.0602 
Ziegler-Nichols 
= 4.16 ; td = 1.04 


Measurement d( 
Critical frequency 
Ultimate gain 
PID Tuning by 
Kc = 0.0036 ; n = 


day considered 

= 0.206 rad/s 
Kcu = 0.0060 
Ziegler-Nichols 
15.26 ; td = 3.82 



These characteristics have been implemented and simulated on the “real” pro- 
cess. The process overcomes a step set point from the steady state to = 500 
mol.m“^ at time 0, then a step disturbance from the nominal value to Cao = 
3.5 X 10^ mol.m“^ at time 150. In the hrst case (Fig. 5.24), an overshoot 
corresponding to a saturation of the actuator occurs at the beginning, which 
disappears when the anti- windup scheme is used in the second case (Fig. 5.25). 
The overshoot becomes acceptable with use of anti- windup. The disturbance is 
well rejected. In the third case (Fig. 5.26), the real PID and the same PID with 
anti-windup give very close results and the overshoot is not very pronounced. 
However, the influence of the disturbance is more visible in this case of mea- 
surement delay and its rejection takes more time. 
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Figure 5.24: Response (left) and input (right) for the closed-loop process 
without delay with the tuning of the real PID controller obtained by the Ziegler- 
Nichols rule 





Figure 5.25: Response (left) and input (right) for the closed-loop process 
without delay with the tuning of the real PID with the anti- windup controller 
obtained by the Ziegler-Nichols rule 





Figure 5.26: Response (left) and input (right) for the closed-loop process with 
delay with the tuning of the real PID controller obtained by the Ziegler-Nichols 
rule 
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5.7 Nyquist Stability Criterion 

Recall that the Nyquist plot represents the imaginary part of the open-loop 
system transfer function with respect to its real part. In the present case 



Gol — Gc Ga Gp Gjn 


(5.89) 


thus its modulus is equal to 






\Goi{ju)\=AR=\G,\\Ga\ 


\Gp\ \Gm\ 


(5.90) 


and its argument 






arg(Goi(j^)) = 4>oi =4>c + 4>a 


+ + (prn 


(5.91) 



At the phase crossover frequency uj(^ and for a gain of the proportional controller 
Kc such that the amplitude ratio ARqi = 1, the open-loop system transfer 
function GqI verihes 

1 = \Gc\ \Ga\ \Gp\ \Gm\ (5.92) 

— 7T = (j)c (pa (pp (pm (5.93) 

In the Nyquist plot, the point of this system is then located at (-1,0). This 
point constitutes the basis of the Nyquist criterion. 

In general, GqI does not present unstable poles (in the right half-plane). 
The modulus \Gqi\ is proportional to the controller gain. When the controller 
gain becomes larger than the ultimate gain, the modulus of the Nyquist vector 
(representing Goi(jio)) increases and the curve “turns around” the point (—1,0) 
clockwise when the frequency increases; the system is then unstable. 





Figure 5.27: Illustration of Cauchy’s theorem 

The Nyquist criterion relies on the following property of complex functions 
(Cauchy’s theorem). 




214 



Chapter 5. Frequency Analysis 



Consider a function F{s) of complex variable 5, a closed contour C of the 
complex plane (Fig. 5.27), and assume that F{s) possesses Z zeros and P 
poles inside the contour C. When 5 covers the closed contour C, the function 
F{s) describes a closed trajectory T. The algebraic number N of encirclings 
(counted positively if they are in the same way as 5 covers C, negatively in 
the opposite way) from origin O realized by the function F{s) is equal to the 
difference Z — P: 

N = Z-P (5.94) 




Figure 5.28: Illustration of Nyquist criterion 

To get the Nyquist criterion, it suffices to choose the function F{s) = 1 + 
Goi{s)^ or by horizontal translation to choose the function Goi{s) with the point 
(—1,0) that replaces the origin. The point (—1,0) is often called the Nyquist 
point. For the closed-loop transfer function to be stable, 1 + Goi{s) must not 
present positive real part roots, thus Z = 0. The contour C, called the Nyquist 
contour, must then contain all the right half-plane (Fig. 5.28) and is symbolized 
by a half-circle of infinite radius, centred at the origin, covered clockwise. If 
one or several of the poles of Goi{s) are present on the imaginary axis, the 
contour C turns around them so as not to include them. The algebraic number 
N of encirclings (counted positively if they are in the same way as s covers C, 
negatively in the opposite way) of the trajectory T of Goi{joo) around the point 
(—1,0) must then be opposite to the number of poles of Goi{joo) inside C, thus 
the right half-plane: N = —P. 

The Nyquist criterion can thus be expressed: 

The closed-loop system of transfer function 




Process Control 



215 



Gel 



Got 

1 + Goi 



is stable if and only if the representative curve of the open-loop 
system GqI encircles the point (—1,0), when the frequency varies from 
— oo to + 00 , as many times anti-clockwise as GqI possesses poles in the 
right half-plane (provided there do not exist hidden unstable modes) . 

When the frequency varies from — oo to +oo, the locus of GqI is closed. If 
Goi has no poles in the right half-plane, thus is stable, and if the locus of GqI 
does not encircle the point (—1,0), then the closed-loop system is stable. On 
the contrary, if the locus of GqI encircles the point (—1,0), then the closed-loop 
system is unstable. 




Figure 5.29: Gain and phase margins on the Nyquist plot 

The gain margin and the phase margin can be represented on the Nyquist 
plot, as well as phase and gain crossover frequencies. In the simple case of Fig. 
5.29, the gain margin GM is obtained from the intersection of the curve Goi{joo) 
with the real axis, the frequency is then the phase crossover frequency uj(^ such 
that argGo/ ) = — tt. The phase margin is obtained from the intersection of 
the curve Goi{joo) with the circle of unit radius centred at the origin (Fig. 5.29), 
the frequency is then the gain crossover frequency ujg such that \Goi{joOg)\ = 1. 
The case where there exist several intersections is discussed in Sect. 5.10. 



5.8 Closed-Loop Frequency Response 

When the closed-loop system is subjected to a set point variation the out- 
put y must present a certain number of characteristics that can be partially 
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explained (Fig. 5.30) by using the closed-loop frequency response of 
Let ARci be the closed-loop amplitude ratio and ip the phase angle 

= (5.95) 




Figure 5.30: Closed-loop desired amplitude ratio for a set point variation 

• At low frequencies {u 0), ARd must be equal to 1, which means that 
there is no asymptotic deviation of the output with respect to the set point 
(refer to the final value theorem). 

• ARci must be maintained close to 1 for frequencies as high as possible in 
order to reach rapidly the new steady state during a set point variation 
(refer to the initial value theorem). 

• It is often desired that the system presents a resonance peak similar to that 
of a second-order system for which minimum ( = 0.5. This corresponds to 
a maximum amplitude ratio at the peak ARd^p = 1.25. The frequency cOp 
must be as high as possible; this is in agreement with the previous point. 

• The bandwidth is the frequency for which ARd = \/2/2 ~ 0.7. If the 
bandwidth is large, the response will be fast with a short rising time. 

In a general manner, the ratio Y/Y^ is expressed as 

y(£)_ ^ G,is)Gais)Gp{s)Km , . 

T^(5) 1 Gc{s)Ga{s)Gp{s)Gm{s) 

and in the simple case where the measurement transfer function is a simple 
gain 



Gm — Kjn 



(5.98) 
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this ratio Y jYr is easily expressed with respect to the open-loop transfer func- 
tion as 



y(g) G,i{s) 

1 Goi{s) 



(5.99) 



with Goi{s) — GcGaGpKjji 

The closed-loop frequency response for set point variations (Fig. 5.31) in 
the Bode representation depicts the frequency influence on the ratio Y jY^. In a 
logarithmic representation, Fig. 5.30 can be found to again have a slight bump 
around 2 rad/s. 




Figure 5.31: Frequency characteristics of the closed-loop second-order process 
response for set point variations {Kp = 2 , r = 5 , (“ = 0.5), plus a real PID 
controller {Kc = 10 , r/ = 5 , ri:) = 4 , /3 = 0.1) 



From the previous equation, the closed-loop amplitude ratio results 
ARcI = ^ 



^/[l + {cos (f>oi/ARoi)f + {sin (f>oi/ARoi)^ 
and the closed-loop phase angle 

sin (f>oi/ARoi \ 



(5.100) 



^0 = arctang 



1 + cos (l)oi/ARoi ) 



(5.101) 



where ARqi and c^oi are the amplitude ratio and phase angle respectively for 
the open-loop system transfer function. 

The Nichols plot is, in fact, a representation in Black coordinates. For given 
values of the amplitude ratio and of the phase angle of the open-loop system, it 
allows us to determine by interpolation the closed-loop values ARd and ip. The 
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+/ 



Goi 



Y{s) 



Figure 5.32: Closed-loop unity feedback system 



Nichols plot has been built with a unity feedback (Fig. 5.32); this means that 
the only transfer function of the direct loop is the open-loop process transfer 
function. In the case where the measurement presents a transfer function (Fig. 
5.33), it is necessary to slightly modify this scheme by introducing it into the 
direct loop and multiplying by the reciprocal of the measurement transfer func- 
tion before the output (Fig. 5.33). It is possible to check that the closed-loop 
transfer function is the same for both block diagrams (a) and (b) of Fig. 5.33. 



Y = 



GKrr 



l^GGr 



Yr 



(5.102) 




Figure 5.33: Closed-loop system with measurement transfer function (a) and 
equivalent system with unity feedback (b) 

The Nichols plot (Fig. 5.34) considers as abscissa the open-loop phase angle 
(^oi of Goi and as ordinate the open-loop amplitude ratio of GqI^ Two families 
of curves on the graph represent the curves at constant amplitude ratio ARd 
(here in dB), and at constant phase angle ip. 

The bold curve is the locus of the points of abscissa poi and ordinate ARqi = 
\Goi\ when the frequency varies from 0.01 (rectangle at the extremity of the 
curve) to 100 rad/s for a second-order system controlled by a real PID. 

At low frequencies, the bold curve is included between curves — IdB and 
IdB; —IdB corresponds to ARd =0.89 and IdB corresponds to ARd = 1.12, 
the excursion for AR^j^ < 0 being very limited. The recommendations 
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Figure 5.34: Nichols plot of a second-order system {Kp = 2 , r = 5 , (“ = 0.5) 
and a real PID controller (Kc = 10 , r/ = 5 , ri:) = 4 , /3 = 0.1) 



1 < ARci <1.25 are thus relatively well respected. Notice that the maximum 
of ARci would be denoted by ||T||oo in a robustness study (cf. Sect. 5.10). 




Figure 5.35: Response to a set point step of a second-order system {Kp = 
2 , r = 5 , C = 0-5) and a real PID controller {Kc = 10 ^ rj = 5 ^ td = 4 , /3 = 

0 . 1 ) 

Then the response to a set point variation is studied. The response to 
a set point step of this second-order system controlled by a real PID (Fig. 
5.35) presents suitable characteristics with respect to the rising time and the 
asymptotic deviation. The overshoot may nevertheless seem too large. 

What has been done for the set point can also be done for the disturbance; 
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it is, however, necessary frequently to verify that an optimal tuning for the set 
point is no more optimal for the disturbance and vice versa. 

Again, in the case of the second-order process controlled by a real PID, 
the ratio Y jD was studied showing the influence of the disturbance. It was 
assumed that the transfer function between disturbance and output is equal 
to that of the process G^. The plot of the closed-loop frequency response for 
disturbance variations was thus drawn (Fig. 5.36). It is noticed that the gain 
remains always lower than 1; the aim \ Y/D\ <C 1 must be reached in the largest 
possible frequency range. 





Figure 5.36: Closed-loop frequency response for disturbance variations of a 
second-order process {Kp = 2,r = 5,(“ = 0.5) plus a real PID controller 
(iCe = 10,rj = 5,ri,=4,/3 = 0.1) 



5.9 Internal Model Principle 



The internal model principle is a recommendation of general interest, usable 
for any controller synthesis. 

According to the diagram of Fig. 5.37, the output is equal to 



y(s) 



G(s) Gc{s) 

1 + G(s) Gc{s) 



Yr{s) + 



1 

1 + G(s) Gc{s) 



D{s) 



The closed-loop frequency response is thus 



(5.103) 



G{juj) Gc(juj) 

1 + G{juj) Gc{juj) 



Yr{jiu) + 



1 






1 + G{juj) Gc(juj) 



D{juj) (5.104) 
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Process 



r-(s) 

+ 



E{s) 



Gc{s) 



Ua{s) 



G{s) 



+ 



+ 



D{s) 



Y{s) 



Figure 5.37: Feedback control with disturbance 



So that the output correctly follows the desired output, it is sufficient that the 
open-loop gain G(juj) Gc(juj) is sufficiently large in the dominant frequency 
domain of the set point Yr{jco) and the disturbance D{ju). According to the 
internal model principle by Francis and Wonham (1976), it is sufficient to place 
the modes corresponding to these frequencies as well as to high frequencies 
in the polynomial denominator of the controller Gc{s). If the set point or 
the disturbance is constant, the dominant frequency is equal to zero, and an 
integrator 1/s must be introduced in the controller, thus s in its denominator. 
In a general manner, Wohnam (1985) mentions that a controller is structurally 
stable only if the controller uses the feedback of the controlled variable, and 
incorporates in the feedback loop a correctly duplicated model of the dynamic 
structure of the exogenous signals (of external source) that the controller must 
treat. As an image form, (Wohnam, 1985) says that any good controller must 
incorporate a model of the external world. 



5.10 Robustness 

The notion of robustness is related to uncertainties dealing either with the 
process itself or the process environment, such as unmodelled disturbances or 
ill-considered process dynamics, frequently at high frequency, in the case where 
a control model is used. If possible, a control system must be not very sensitive 
to these factors. However, it must be noted that a compromise performance- 
robustness must be searched. Clearly, robustness is essential in an industrial 
context (Larminat, 1991). 

A possibility (Morari and Zahriou, 1989) is to quantify the relative error on 
the model by 

(5.105) 

Gr 

where G is the actual process model (unknown by dehnition) and G the used 
available model. So that the controlled process is stable, it is necessary that 

rnax|em| < — (5.106) 

ARci,p 

ARci^p being the value of ARd at the resonance peak. 
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To get a given value of ARd^p for the controller, the gain and phase margins 
GM and PM must satisfy 



GM > 1 + 



1 

ARqI^p 



(5.107) 



PM > 2 arcsin( 1/2 (5.108) 

Thus, to get ARci^p = 1.25 as previously recommended, this gives GM = 1.8 
and PM = 0.823 rad = 47°. 

Actually, there exist more formal approaches, such as the H^o theory, based 
on the use of norm and a frequency study (Doyle et ah, 1992; Kwakernaak, 
1993; Morari and Zafiriou, 1989; Oustaloup, 1994). A brief overview will be 
given in this section, in order to show all the interest in this theory and to open 
new horizons. 

First, define the usual norms for signals: 

2-norm or Euclidean norm of the signal x{t) (of finite energy) 



lkl|2 




oo-norm of the signal x{t) 



||x||oo = max|x(t)| 



(5.109) 



(5.110) 



For a signal x{t) of finite mean power, the square root of the power is dehned 



lim 

T -^oo 




1/2 



(5.111) 



Then, similarly, the norms are defined for a system transfer function (with 
respect to frequency co) 

2 -norm of G{s) 

/ 1 /‘oo \ 1/2 

I|G||2=(^ J JG{jd\^du;j (5.112) 

oo-norm of G{s) 

||G||oo = max|G(ja;)| (5.113) 

(jJ 

Thus, the infinity norm of G: ||(T||oo represents the maximum amplitude of G 
in the Bode plot. 

Consider the general block diagram (Fig. 5.38) for robustness study. d 2 is a 
measured disturbance infiuencing the output y, so that a feedforward Gff is 
used (cf. Sect. 6.6.3). di is an unmeasured disturbance and 77 represents a 
noise that infiuences measurement. F{s) is a prefilter modifying the set point. 

From the open-loop transfer function GqI = G P, two new functions are 
defined which allow us to characterize the system: 
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Figure 5.38: Closed loop system considered for robustness study 



The sensitivity function S 



S{s) = 



E{s) 



1 



T^*(5) 1 Gol{s) 

• The complementary sensitivity function T 

T(.) = 

which is related to the closed-loop transfer function by 
Gci{s) = ^=T{s)F{s) 



(5.114) 



(5.115) 



(5.116) 



Assuming that the disturbance (I 2 and the associated feedforward controller 
do not exist, notice that the sensitivity function would also be equal to 



S{s) = - 



m 

Di{s) 



(5.117) 



and thus represents the sensitivity of the system to a disturbance di, as e 
represents the error. The comparison of both expressions of the sensitivity 
function shows that the set point and the disturbance act out the same role. 

The sensitivity and complementary sensitivity functions depend only on the 
transfer functions inside the loop and always verify 



S{s) +T(5) = 1 (5.118) 

From scheme 5.38, the output is equal to 

y (s) = T{s) F{s) y^s) + S{s) Di (s) + [1 + G//(s) P{s)\ S{s) D 2 (s) - T{s) N{s) 

(5.119) 

It clearly appears, from this equation, that if the influence of the disturbance 
is to be limited, S{s) must be small and if the influence of the measurement 
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noise is to be limited, T{s) must be small. A compromise must then be found 
between the disturbance attenuation and the filtering of measurement errors. 
The following relation 



AT/T dT P 

lim = 

AP^o AP/P dP T 



(5.120) 



which presents a limited mathematical interest, is rich at the physical level. 
Indeed, the left member represents the ratio of a relative variation of the com- 
plementary sensitivity function T over a relative variation of transfer function 
P (uncertainty in the process). On the other hand, using the definitions of S 
and T, it can be shown that the right member is equal to the sensitivity func- 
tion S. The conclusion is that the sensitivity function measures the sensitivity 
of the closed-loop transfer function to a process variation. 

A criterion of tracking performance^ could be to postulate that the user’s 
wish is 

Halloo <e (5.121) 

in order to minimize the tracking error, the amplitude of which would be lower 
than e for a set point of amplitude lower or equal to 1. 

To limit the influence of noise 77, we should have^ 



||T||oo <e (5.122) 

The same condition is obtained to limit the influence of the disturbance. Clearly, 
the objectives of tracking and disturbance rejection on one hand, of noise sup- 
pression on the other are contradictory. 

It is also possible to filter the set point, e.g. by a lowpass or bandpass filter 
of transmittance Wi so that the previous equation would become 



||f^i^||oo<l (5.123) 

where Wi represents a frequency weighting function. Equation (5.123) is the 
condition of nominal performance. Notice that this equation can also be written 
in the form 

which means that, in the Bode plot, the sensitivity curve is located below a 
prescribed curve for any frequency. 

^In the multivariable case, we impose 

<^max|*S^(ja;) I << 1 

where <Jmax is the largest singular value of S{juu). 

^In the multivariable case, we would have 



<^max|F(ju;) I << 1 
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Figure 5.39: Nyquist locus for the open-loop transfer function with the gain 
margin and the phase margin 



From the dehnition of S, the previous inequality can also be transformed 
as 



W,{ju) 
1 + Goi {j 



< 1 









(5.125) 

thus, in the Nyquist plot, the locus of Goi{j cj) is situated entirely outside the 
disc of centre —1, radius \Wi{j io)\ (Fig. 5.39), which corresponds to a more 
restrictive vision of the traditional Nyquist criterion. In this hgure, the gain 
margin can be dehned as the open interval which can be interpreted in 

the following manner by denoting by P the model of the process P which thus 
represents an approximation of P, for any g taken in the interval the 

controller of gain G stabilizes the process P when P is equal to g P. 

Taking the intersection of the locus of Goi{j cj) and of the circle of centre O 
and unit radius, the point of the gain crossover frequency cOg is obtained such 
that \Goi{jiOg)\ = 1 , and the angle thus formed with the abscissa axis is the 
phase margin (j). If there exist several intersections, the phase margin is the 
minimum of the formed angles. 

The delay margin is dehned from the gain margin and the gain crossover 
frequency cOg as 



Md = — (5.126) 

CJg 

The delay margin corresponds to the largest delay 6 such that the controller of 
gain G stabilizes the process P when P is of the form exp (— 6 ^ 5 ) P. 

In fact, these margins can reveal themselves to be insufficient, and the 
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distance from point —1 to the locus of GqI equal to 



1 



(5.127) 



is considered. It means that a model error, which was not taken into account 
by the gain and phase margins must be larger than a certain value so that the 
process is unstabilized with the chosen controller of gain C. 

Now introduce the model error such that the nominal transfer function is 
P and the modified or perturbed function is in the form 



P=(1 + AVF2)P (5.128) 

thus AVF 2 represents the relative error on the process, with A being a stable 
transfer function such that its norm || A||oo < 1, and W 2 is also a stable transfer 
function, which then verifies 



p{j^) 

P{3^) 









(5.129) 



Hence, \W 2 {j cj)| defines an uncertainty profile such that the point P / P is in a 
disc of centre 1, radius IVF 2 I. Note that this radius corresponding to uncertainty 
increases in general with frequency u. 

The robust stability is defined as the property that the closed-loop system 
is stable with a given controller C when the process P is situated in a ball V. 
Then consider the open-loop perturbed function 

PC = (1 + AVF 2 ) PC = (1 + AVF 2 ) Goi (5.130) 

which will be studied in the Nyquist plane. This function must not encircle the 
point of abscissa —1, thus consider the quantity 



1 + PC = 1 + (1 + AVF 2 )Co/ = (l + Co/)(l + AVF 2 T) (5.131) 

The condition of robust stability for multiplicative uncertainty can be formu- 
lated as 

||1H2T||oo<1 (5.132) 

and the stability margin is the reciprocal l/|| IF 2 T||oo- H results that 

||A1T2T||oo< ||W2T||oo<1 (5.133) 



Equation (5.132) is written again 



W2{juj) Goijjuj) 

1 + Goi {jio) 



< I Vcj 



\W2{juj)Goi{juj)\ < \lPGoi{juj)\ 



(5.134) 

which can be expressed graphically (Fig. 5.40) by the fact that the critical 
point —1 is always located outside the uncertainty discs of centre Goi{joo) and 
radius | W 2 (ju;) Goi {juj) \ . 
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Figure 5.40: Nyquist locus for the open-loop transfer function and uncertainty 
discs 

A system should simultaneously present a good performance and good sta- 
bility. For this reason, the conditions of the equation of nominal performance 
(5.123) and of the equation of robust stability (5.132) are gathered under the 
condition of robust performance 

II |1Fi^| + |1F2T| ||oo<1 (5.135) 

which allows us to simultaneously guarantee both previous conditions. Again, 
a graphical representation can be achieved in the Nyquist plane (Fig. 5.41). 
The discs centred in —1 and in Goi{joo) respectively must then be disconnected 
to guarantee condition (5.135). 




Figure 5.41: Nyquist locus for the open-loop transfer function with the per- 
formance and stability discs: these two discs must be disconnected to guarantee 
robust stability 

These considerations for robustness hnd their application in the design of 
optimal robust controllers (Doyle et ah, 1992; Oustaloup, 1994), which minimize 




228 



Chapter 5. Frequency Analysis 



the following criterion deduced from Eq. (5.135) 

C{C) = \\ {\WiS\^ + \W2T\^y^^ (5.136) 

Let us cite some general ideas and recommendations 

• The spectra of signals of reference and disturbances (factor of S{s) in Eq. 
(5.119)) are concentrated in general around low frequencies and the spec- 
trum of measurement errors (factor of T{s) in Eq. (5.119)) covers a larger 
frequency domain. 

• In a simplified manner, the wish is to obtain small \S{juj)\ at low frequencies 
(in the process passband) and small \T{juu)\ at high frequencies (in the 
process stopband). 

• The stability margin is frequently estimated by calculating the maximum 
of the modulus of the sensitivity function max(|5'(jcj)|) or of the comple- 
mentary sensitivity function max(|T(jcj)|). Thus, the modulus margin is 
introduced, for which De Larminat (1993) recommends 

max|T(jcj)| < 1.3 (5.137) 

(jj 

which is, in general, less conservative than Eq. (5.132). 

• In a very similar way to Eig. 5.30, the complementary sensitivity function 
T{ju) is close to 1 at low frequencies, presents a peak for a resonance fre- 
quency, and then decreases at high frequencies. T should decrease quickly 
at high frequencies in order to diminish the infiuence of model errors in this 
frequency domain. 

• The sensitivity function S{ju) is small at low frequencies to protect from 
model errors in this frequency domain, and tends towards 1 at high frequen- 
cies. If possible, S should not present a peak in the transition region. 

• Control IX, according to Eig. 5.38, is equal to 

(5.138) 

The amplitude of process inputs is physically limited. When the open-loop 
gain I (To/ 1 = | (TP | is large with respect to 1, the ratio (T/(1+(TP) isnearl/P 
and represents the sensitivity of the input with respect to the disturbances 
and to the set point, which could be denoted by Su- In comparison with the 
complementary sensitivity function T, the relation T = SuP obtained. 
As the input u is to be limited, the sensitivity Su should not be too high, 
which is realized by imposing a large open-loop gain at low frequency and 
by making Su decrease rapidly at high frequency. 

• The output, following Eig. 5.38, is equal to 

y = ^ ^cp {yr-v) + -^^^p d = T{yr-iq)+Siq (5.139) 

The measurement noise 77 thus infiuences the output through the comple- 
mentary sensitivity function T, hence the importance of the decrease of 
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T{ju) at high frequencies. The influence of the set point is also realized 
through the complementary sensitivity function at low frequencies. To avoid 
too large inputs, it may be necessary to hlter the set point, which reduces 
the passband. 

• In the Bode plot, the open-loop transfer function must respect a certain 
frequency specihcation performance-robustness. At low frequency, the con- 
dition of nominal performance (5.123) prevails, while at high frequency, this 
will be the condition of robust stability (5.132). 

• A necessary condition of robust performance is that 

min|TCi(ja;)|, \W 2 {juj)\ < 1 (5.140) 



which implies that at least one of the two functions Wi or W 2 has a mo- 
dulus lower than 1. In practice, on the one hand, \Wi{jio)\ is a decreasing 
function with respect to co dominating at low frequency for the tracking of 
low frequency signals, on the other, \W 2 {jio)\ is an increasing function with 
respect to cj, dominating at high frequency as well as the uncertainty. 
Design of a controller C: 

The controller must satisfy the condition of robust performance (5.135). A 
possibility is to build the function GqI so that this condition is verihed, then to 
calculate the controller as (T = Gqi! P- Of course, G must be proper. Moreover, 
if P or P~^ is not stable, GqI must contain the unstable poles. With respect 
to Goh fhe condition of robust performance is written as 



Wi 


1 


IV 2 


1 + 




1 + Gol 






(5.141) 



while respecting Eq. (5.140). 

• At low frequency, | Wi | ^ 1 > | W 2 \ , which gives 



\Goi \ > 



1^1 1 

1-IW2I 



or 



1^1 1 

1 - 1W2I 



1^1 <1 



(5.142) 



This condition of robust performance is stronger than the condition of nom- 
inal performance |VEi/S| < 1. 

• At high frequency, IVE 2 I ^ 1 > |VEi|, which gives 



\Goi \ < 



|fD2| 



or 



1 ^ 2 ! 



|T| <1 



(5.143) 



This condition of robust performance is stronger than the condition of robust 
stability IVE 2 TI < 1. 

The design of GqI according to the frequency specihcation can then be per- 
formed in the following manner. In the Bode plot for the modulus, the curve 
of (|VEi|)/(l — IVE 2 I) is drawn in the low-frequency domain and the curve of 
(1 — |VEi|)/(| 1T2|) in the high-frequency domain. In fact, given the characte- 
ristics of Wi and VE 2 , it is practically equivalent to simply consider the curve 
of (|VEi|) at low frequency and the curve of 1/(|1T2|) at high frequency. The 
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curve of GqI must be situated above the first curve at low frequency, and below 
the second one at high frequency. Moreover, so that the controller transfer 
function C is proper, \Goi\ must decrease at least as fast as \P\ at high fre- 
quency. Lastly, the transition from low to high frequency must be smooth with 
a small slope when the modulus is near 1 (gain crossover frequency). Doyle 
et al. (1992) recommend a maximum slope equal to 2, and that at the gain 
crossover frequency, the argument of GqI is larger than —it. A trial and error 
procedure will often be necessary. 

Example 5.3: Design of a Robust Controller for Delay Compensation 

Design a controller G that allows us to stabilize a first-order process presenting 
an unknown time delay. 

Consider a first-order process, with time delay, of the transfer function 

P = — 1— exp{-tds) = P exp{-tds) = P (1 + AW 2 ) (5.144) 

r 5 + 1 

where P represents the first-order process without time delay, thus is non- 
perturbed. Suppose that the unknown time delay td is included between 0 
and a maximum time delay tdm- Numerical values are r = 10, tdm = 4. The 
uncertainty associated with the time delay is considerable compared to the 
process time constant. The design of the controller will be realized by a series 
of trial and error calculations. 




Figure 5.42: Relative uncertainty (continuous curve) and its approximation 
(‘o’) for the first-order process (r = 10) with unknown time delay {tdm = 4) 



The relative uncertainty associated with the process (Fig. 5.42) is equal to 

AW2P 



Replacing s hj jco gives 

AW2P 



= exp{-tds) - 1 



= -2jsin(ho;) exp(-jho;) 



(5.145) 



P 



(5.146) 
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having the following modulus 



AW2P ^ . 

p = 2 sin(-cj) 



(5.147) 



In fact, the time delay td is unknown, but is itself bounded by tdm so that an 
approximation of the bound of the relative uncertainty is 



AW2P . ^tdm ^ 

< 2 sm( io) 

P - ^ 2 ^ 

from which the frequency weighting concerning robustness is deduced 
f W2UU1) = 2 sin(^a;) ifw < ^ 

< ^ ^dm 



(5.148) 



W2{jui) = 2 



if cj > 



The higher limit of the bound is reached at frequency when sin(^^cj) = 1 . 
On the other hand, the uncertainty is equal to 100 % when A = 1 , which gives 
the maximum bandwidth [0, 3 ^^] ~ [0, Using a Fade approximation of 

order 1, the relative uncertainty is approximated (Fig. 5.42) by 



AW 2 P _ tdS 
P ^ Td “ 



(5.150) 



so that we choose 



i^s + l 



(5.151) 



By setting GqI = PC ^ C being the controller transfer function, the following 
relation will have to be checked in the high-frequency domain 



\Goi{j^)\ < 



\W2{M 



The transition for GqI is located in the frequency range [ 3 ^^, Hd ^ td 

= tF’ ""2 = 

We begin the calculation by setting the open-loop transfer function 



Goi — 



cs -\-l 



(5.153) 



b should be as large as possible to minimize the tracking error. We take c = tdm 
so that Goi begins to decrease at We also desire that \Gqi\ < 1 . As IVF 2 I 
is equal to 1 for cj = 2/{y^tdm) = 0.289, it gives b = 1.53. Note that this 
constraint is more severe than \G 0 i\uj 2 < U 

On the other hand, the frequency weighting relative to performance is cho- 



I VFi| = a liuj < 
I VF 1 I 0 if cj > 



(5.154) 
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From the inequality valid at low frequencies 



\Goi\ > 



1^1 1 

1-IW2I 



Vcj < cji 



it results that 
which gives a = 0.316. 



(5.155) 

(5.156) 




Figure 5.43: Response to a set point step of the perturbed system for different 
values of the time delay (0, 2 and 4) and the first controller 



For the first-order process chosen, fixing the time delay (0, 2 and 4), it is then 
possible to simulate the closed-loop response to a set point step (Fig. 5.43). 
As the controller equal to 



C = 



1.53(105+1) 

45+1 



(5.157) 



presents no integral action, the deviation which existed with respect to the 
set point has been compensated for by the introduction of a pure gain equal 
to 1.654 between the set point and the comparator. For the largest value of 
the time delay, the response presents a relatively large overshoot. Thus, this 
constitutes the first trial with the following numerical value of the open-loop 
transfer function GqI 

1 53 

G. = ^ (5.158) 

We wish to increase \Goi \ on the passband [0, by modifying GqI as 



Goi 



b 5 + 5 
45+1 5 + 1 



(5.159) 



The same method as previously gives b = 0.42. The weighting W 2 is unchanged, 
as well as Wi = a. If the value b = 0.42 is kept, the response is more unstable 
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Figure 5.44: Response to a set point step of the perturbed system for different 
values of the time delay (0, 2 and 4) and the second controller 



than in the previous case. For that reason, b has been modified arbitrarily tob = 
0.21, which allows the correct responses as in Fig. 5.44. The precompensation 
gain is equal to 1.95. 

This constitutes the second trial, with the following controller transfer func- 
tion 



C = 



0,21(105 + 1) 5 + 5 
45 + 1 5 + 1 



(5.160) 



The waited improvement with respect to the previous controller, which resulted 
from the simple application of robustness rules, is not evident. 

For the third calculated controller, the choice is made to provide it with an 
integral action so that its transfer function is 



5(105 + 1 ) 5 + 5 
5 5 + 1 

and the open-loop transfer function is 



Goi 



5 5 + 5 
5 5 + 1 



(5.161) 



(5.162) 



The calculation then gives 5 = 0.08 and Wi = a = 0.338. With this value, 
oscillations are strong, so 5 is reduced to 5 = 0.04. With this value, responses 
are quite acceptable (Fig. 5.45), and it is not necessary anymore to set a 
precompensation gain. This constitutes the third trial. 

By filtering the set point by a first-order transfer function equal to 1/(105 + 
1), it is possible to absorb much larger time delays than tdm- Thus, the response 
of Fig. 5.46 was obtained for 5^^ = 6 at the expense of a sluggish oscillation 
but with a smaller overshoot than the response obtained with td = ^ and 
precompensations by only pure gains. 

Figure 5.47 shows that in these conditions, the modulus \Gqi\ of the open- 
loop transfer function is larger than |VFi| at low frequencies respecting the 
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Figure 5.45: Response to a set point step of the perturbed system for different 
values of the time delay (0, 2 and 4) and the third controller with integral action 




O 10 20 30 40 50 60 70 80 90 1 0O 



Figure 5.46: Response to a set point step of the perturbed system for a time 
delay equal to 8 and a filtering of the set point by a first-order 



nominal performance, and is smaller than I/IVF 2 I at high frequencies respecting 
the robust stability. The set |VFi| and I/IVF 2 I gives the frequency specification 
to be respected. Moreover, the sum |VFi/S| + IVF 2 TI relative to the condition of 
robust performance is lower than 1 on all the frequency range. The robustness 
criteria are thus respected. 

If the behaviour of the sensitivity function S and of the complementary 
sensitivity function T (Fig. 5.48) is examined, it shows that the modulus of 
T tends towards 1 at low frequencies. At high frequencies, the modulus of S 
tends towards 1 as foreseen. In the intermediate frequency range, the modulus 
of S shows a small bump while \T\ decreases continuously. It would be better 
that \T\ presents a slight bump at the resonance frequency. 

Doyle et al. (1992) frequently recommend using 



|l^l| = 0.5 (|P|2 + 1)1/2 ^ ^ (|p|-2 ^ ^)l/2 



(5.163) 
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Figure 5.47: Modulus |VPi| of the weighting function relative to nominal per- 
formance ( ). Reciprocal I/IVP 2 I of the modulus of the weighting function 

relative to robust stability (---). Modulus \Gqi \ of the open-loop transfer func- 
tion (continuous curve). Sum of moduli |VPi/S| + |Vf 2 ^| for robust performance 

(...) 




Figure 5.48: Modulus of sensitivity function S (- -) and of complementary 
sensitivity function T (continuous curve) 



In the present case, it gives 



P = 



K 

rs 1 



\Wi\ w0.5(K2 + 1)1/2 ^ 
IW2I W 0.5^0; , 



Vcj > CJ2 



(5.164) 



The new weightings have been drawn in Fig. 5.49 which must be compared 
to Fig. 5.47. It shows that the modulus |VFi| is increased and the allure 
of I/IVF 2 I is relatively different and imposes a more severe constraint. The 
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Figure 5.49: Application of recommendations of Doyle et al. (1992). Modulus 
I Wi I of the weighting function relative to nominal performance ( — ) . Recipro- 
cal I/IVF 2 I of the modulus of the weighting function relative to robust stability 
(---). Modulus \Goi\ of the open-loop transfer function (continuous curve). 
Sum of moduli |VFi/S| + IVF 2 TI for robust performance (. . .) 



frequency specification is still respected by the same open-loop transfer function 
as previously, thus the same controller. 

Kwakernaak in (Oustaloup, 1994), setting L = GqI and L~^ = G~^ ^ recom- 
mends choosing the following weightings 



\W2{JUJ)\ 



0 

0 



at low frequencies 
at high frequencies 
at low frequencies 
at high frequencies 



(5.165) 



where functions \ Wl\ and |VfL-i| respectively represent bounds of relative un- 
certainties on the open-loop transfer function and its reciprocal. 

Taking the controller (7 = 1, one obtains G~^ = (1 + rs) exp(t^^5), which 
gives at low frequencies 

Wl-i{s) = l-exp{-tds) Wi{juj) ^ jtdu; (5.166) 

while, at high frequencies, the same expression as at the beginning of this 
example is obtained 

Wl{s) = exp(-tds) -1 W2{juj) = exp{-jtduj) - I (5.167) 

The weighting Wi reaches its maximum at low frequencies (domain [0, l/tm]) 
in cji = Ijidm = 0.25, that gives |VFi| = 1, which is not a very large value, 
neither is it very different from previously found values. 
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5.11 Summary for Controller Design 

Maciejowski (1989) gives an excellent description of a controller design. Refer 
to Fig. 5.38 for notations. 

In a simplified way, the desire is that the sensitivity function \S{jio)\, which 
intervenes as a factor of disturbances or of parameter uncertainty is small at low 
frequencies {ou < oUq) and the complementary sensitivity function |T(jcj)|, which 
intervenes as a factor of the measurement noise is small at high frequencies 
(cj > CJ 5 ). The design results from a compromise. 

The classical rule is that to make \S{s)\ small, |1 + (C( 5 )P( 5 )| should be 
large, thus if |P( 5 )| is not large, a large controller gain |C( 5 )| is necessary 
at low frequencies. Similarly, to make \T{s)\ small, |1 + (C( 5 )P( 5 )| should be 
near zero; it may be necessary that the controller gain |C( 5 )| is small at high 
frequencies, when |P( 5 )| does not decrease sufficiently fast at high frequencies. 

The frequency is defined taking into account the spectrum of the dis- 
turbance, as 

\S{jcj)\ <e , < cjo (5.168) 

The frequency is the passband of the loop and is the smallest frequency such 
that 

\T{jLOb)\=T{0)/V2 (5.169) 

It is larger than lOq- The passband is approximately reciprocal with respect 
to the response time of the output y{t) to a disturbance step. Thus, it is an 
indicator of the speed of rejection of a disturbance. On the other hand, the 
passband is frequently very near to the gain crossover frequency cOg and is 
often such that 

^4> ^ ^ (5.170) 

The transmission band equal to the passband of F{s)T{s) indicates the 
speed of the response to the set point. 

The internal model principle by Francis and Wonham (1976) must be used. 
It can be expressed in frequency terms such that 

|P(0)O(0)| = 00 (5.171) 

to ensure a zero steady-state error facing step disturbances, or 

lim cj\P(jcj)C(jcj)\ = 00 (5.172) 

to ensure a zero steady-state error facing ramp disturbances. 

The loop stability must be considered, an excessive phase lag must be 
avoided when 

\P{jLo)C{jLo)\=l (5.173) 

because of the Nyquist criterion, which limits the action on the loop gain. Near 
the gain crossover frequency, the speed of decrease of the open-loop transfer 
function, for a minimum-phase system, must be lower than 40dB/decade. 
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It is better to design a stable open-loop controller, although this is not 
essential. 

The existence of unstable zeros or poles in process P{s) diminishes the 
efficiency of the output feedback by decreasing the frequency domain on which 
it is efficient. 

For a stable open-loop system, the Bode criterion gives 

oo 

\og{\S{jio)\dio = 0 (5.174) 

In fact, if the sensitivity decreases more strongly in a domain of low frequencies, 
a sensitivity peak results at higher frequencies. 
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Chapter 6 

Improvement of Control 
Systems 



Three advanced control systems have been presented in Chap. 4: internal model 
control, pole-placement and linear quadratic control. These control systems are 
essentially implementable in a computer or a microprocessor. Excepted these 
cases, the only type of control studied up to this chapter concerned simple 
feedback closed-loop with proportional, PI or PID controllers. Although this 
type of control is very important and very much used, it may be insufficient for 
all existing systems because of different problems met in real processes. 



6.1 Compensation of Time Delay 

Time delays inside a process control system may have various origins: trans- 
port of ffuids, phenomena that are slow to appear, recycle loops and dead 
times due to measurement devices. In all of these cases, classical control will 
not be satisfactory. Thus, a disturbance is detected only a long time after it 
effectively enters the system: indeed, it is necessary that the measured output 
has deviated with respect to the set point; therefore, the action to compensate 
the disturbance effect will be performed with a time delay and often will be 
inappropriate. 

The pure time delay, by adding a phase lag to the feedback loop, is an 
important source of instability for closed-loop control. The tendency will then 
be to reduce the controller gain, which will lead to a more sluggish response. 

A solution to this problem is the Smith predictor ; the development of this 
predictor lies in the knowledge of a process model (Smith, 1957, 1959). The 
model of the process transfer function Gp{s) is decomposed into two factors 

Gp{s) = G{s) exp(-fds) (6.1) 

i.e. a classical transfer function and a pure time delay. 
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The theoretical open-loop response (Fig. 6.1) to a set point variation is 
equal to 



= Km Gc{s) Ga{s) G{s) exp{-tds) Gm{s) Yr{s) ( 6 . 2 ) 

where the “tilde” notation indicates a modelled variable or function (thus in- 
cluding an error). The real response is simply deduced from the theoretical 
response by suppressing the “tilde”. 



Process 




Figure 6.1: Feedback closed-loop system in the case of a process with time 
delay 

Thus, the measured variable Ym{s) is known to the user with a time delay. 

The aim of the Smith predictor is, based on the knowledge of the time 
delay, to operate as if the process transfer function was effectively separated 
into two factors and thus to obtain the information T*(s) without time delay, 
as demonstrated in Fig. 6.2 which is completely theoretical. As a matter of 
fact, it is impossible to physically separate the time delay from the rest of the 
process. 



Process 




Figure 6.2: Theoretical obtaining of the measured output without time delay 
The theoretical measured output without time delay would be equal to 

i;;(s) = Km Gc{s) Ga{s) G{s) Gm{s) Yr{s) (6.3) 

this represents the desired output; it then “suffices” to add to Ym{s) the quantity 
Gc{s) Ga{s) G{s) [1 - exp{-ids)] G^{s) V-(s) (6-4) 



to get that result. Imagine that, in the block diagram of Fig. 6.3, the wire 
bearing the signal is cut (the loop is opened); in the real process (in open 
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loop), the following signal is obtained 

= Ga G Gm (1 — exp(— Gc Kjn Yr + Gc Ga G exp(— Gm Yr 
^GaGGmG.KmYr 

(6.5) 



Process 




Figure 6.3: Smith predictor 

The Smith predictor is thus composed of the conventional controller and a dead 
time compensator, which plays the prediction role. Internal model control can 
be mentioned concerning this controller, as the process model parameters are 
used in the predictor. 

The compensation will be perfect only if the model itself is perfect and 
the most important deals with the time delay. Thus, Schleck and Danesian 
(1978) found that when the error is larger than 30% the response is no better 
than with a classical controller. Moreover, disturbances pose serious problems 
(VanDoren, 1996). 

The realization of an analog Smith predictor is not simple and many modi- 
hed Smith controllers have been proposed to improve the original one. On the 
other hand, its discrete equivalent is easy to realize and, moreover, the latter 
can be made adaptive (Niculescu and Annaswamy, 2003). 



6.2 Inverse Response Compensation 

The phenomenon of inverse response is produced when two antagonistic effects 
occur at the core of a process. The response initially goes in one direction as a 
transfer function overcomes the other one for small periods of time, then goes 
in the opposite way towards its steady state when the second transfer function 
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overcomes the first one. Such a system (Fig. 6.4) has already been presented 
previously for two first-order transfer functions. 



Process 




Figure 6.4: Inverse response process 



It is possible with a PID controller to limit the influence of the inverse response 
owing to the feedforward effect of the derivative action. 

The best compensation that could be simply realized is based on a compen- 
sator designed in the same spirit as the Smith predictor. 

The following reasoning is performed for two first-order systems and valid 
only in that case, but the general principle remains identical. 

Let 



(Ti(5) 



Ki 

TiS + 1 



and 



G2(s) 



K2 

T2S + 1 



( 6 . 6 ) 



be the two considered first-order transfer functions representing the process 
(Fig. 6.5). The condition of inverse response (existence of a positive zero) is 



T2 A2 



The Laplace transform of the open-loop output is equal to 



(6.7) 



Y = GcGa[Gi-G2]K^Yr 

Ki{t2S + 1 ) - K2{tiS + 1 ) 



= GcG, 
= GrG. 



(tiS + 1) (t2S + 1) 
(ViT2- V2Ti)s + (Vi -V 2 ) 
(tiS + 1) (t2S + 1) 



Krr,Yr 



( 6 . 8 ) 



Krr,Yr 



The transfer function Y/Yr possesses a positive zero (thus, the inverse response 
will occur) when 



K2-K1 
Ki T2 - K 2 Ti 



(6.9) 



To eliminate the inverse response, it “suffices” to suppress this positive zero 
of the open-loop response. The quantity T'(s) is added to T(s) so that the 
desired output 



T*(s) =T'(s) +T(s) 



( 6 . 10 ) 
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does not present a positive zero anymore. It gives 



Y\s)=aGcGaK^ V 

A2 Ki 



y*(s) = Ge Ga Km [Gi - G2 + ^ - V 

K 2 Ki 

— C C K ~ ^ ~ ^ Y 

Ti5 + 1 T25 + 1 

_ ^ ^ ^ [{Ki - a)T 2 - {K 2 - g)ri] s + (Ki - K 2 ) 

" "I (ris + l)(T2S + l) 



The necessary and sufficient condition for a zero to be negative is thus 

K 1 -K 2 

{Ki - a) T2 - {K2 - a) Ti 

As Ki > 7^2, the condition on parameter a results 

K2 Ti - Ki T2 



The compensator transfer function is thus equal to 

G^comp = a Ga — — :: — <5^ 

T25+1 Ti5 + 1 



taking into account the models of the actuator denoted by a “tilde”, of the 
process and of the measurement. 



Process 




Figure 6.5: Compensation of inverse response 

The block diagram corresponding to this compensator is given by Fig. 6.5. In 
this case as for the Smith predictor, the efficiency of the compensator will be 
linked to the accuracy of the process model. 
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6.3 Cascade Control 

Control systems studied up to now have considered only one measured output 
and one control variable with a single loop. In cascade control, one control va- 
riable and more than one measurement are used. A frequent example of use of 
cascade control is the temperature control of a chemical reactor equipped with 
a jacket, e.g. for exothermic reactions (Fig. 6.6). Among the possible distur- 
bances are the feed flow rate and temperature, the thermal fluid temperature 
and, of course, the reaction enthalpy and kinetics. 

Reactants 




Figure 6.6: Thermal control of a reactor with jacket by cascade control 

In a simple feedback control, to regulate the temperature in the reactor where 
an exothermic reaction occurs (inducing a potential runaway of the reaction, 
for example), the temperature would be measured inside the reactor; this tem- 
perature would be compared to a set point value and the inlet temperature or 
the flow rate of the heating-cooling fluid would be manipulated to maintain the 
reactor temperature close to its set point by acting through a controller on a 
valve situated on the thermal fluid. 

When the action bears on the fluid flow rate or its inlet temperature, this 
action has no immediate consequence because of the heat transfer dynamics 
through the reactor wall and because of the thermal inertia of the reactor. To 
avoid this drawback, the control system can be greatly improved by realizing 
two temperature measurements: one in the reactor and the other one in the 
jacket (Fig. 6.6). The measurement in the jacket will react much faster than 
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the one in the reactor. 

The main measurement is the temperature in the reactor; this loop is called 
“master” or primary. The temperature measured in the reactor is transmitted 
by a temperature transmitter (TT) to a controller (TC), which receives an ex- 
ternal set point (provided by the operator). The cooling fluid disturbances are 
detected earlier owing to the measurement in the jacket, which acts through 
a second temperature transmitter (TT) on a second controller (TC); the con- 
troller of this loop has as a set point the output of the primary controller. For 
this reason, this loop which uses the jacket temperature is called “slave” or 
secondary. 

Thus, two temperature measurements are available, but only one manipu- 
lated variable: the thermal fluid flow rate. 




Figure 6.7: Block diagram in cascade control 

According to the block diagram (Fig. 6.7), the process is divided into two 
parts: one affected by secondary control and the other one by primary control. 
The output y of process 1 is the variable to control. The output of process 2 
influences process 1, but its control is not the fundamental point. Disturbances 
d 2 occurring at the level of the secondary loop are corrected before reaching the 
primary loop; this is not the case for disturbances di which directly influence 
the primary loop. 

A pure gain K^i has been introduced to compensate the measurement of 
the primary loop. It is not necessary to use a gain to compensate the 
measurement of the secondary loop, as it can be integrated in the secondary 
controller gain. 

The closed-loop transfer function for the secondary loop is 

Ui ^ G,2GgGp2 
Yr2 1 + Gc 2 Ga Gp2 Gm2 



(6.16) 
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Figure 6.8: Catalytic reactor presenting two cascade controls 



and the stability of the secondary loop is determined by the roots of the fol- 
lowing characteristic equation 

1 + Gq12 = 1 + Gc 2 Ga Gp2 Gm2 = 0 

thus the open-loop transfer function of the secondary loop is equal to 

Gq12 = Gc 2 Ga Gp2 Gm2 (6.18) 

In fact, the output Y 2 of the secondary loop is the input f/i of the primary 
process. The set point ¥^2 of the secondary loop is the output of the controller 
of the primary loop. Finally, the cascade system can be represented as the 
equivalent block diagram in Fig. 6.9. 

The closed-loop transfer function for the primary loop is 






Gc 2 Ga Gp2 
1 + Gc2 Gg Gp2 Gm2 
Gc2 Gg Gp2 ^ 



1 + Gc2 Gg Gp2 Gjn2 



Gpl Gml 



(6.19) 



and the stability of the primary loop is determined by the roots of the following 
characteristic equation 
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Measurement 1 



Figure 6.9: Equivalent block diagram for cascade control 



thus the open-loop transfer function of the primary loop is equal to 

Gc2 Ga Gp2 



Goii — Gel 



1 + Gc2 Ga Gp2 Gm2 



Gr)l G 



pi '^ral 



( 6 . 21 ) 



In general, cascade control presents better stability characteristics than a 
simple feedback control; moreover, it is less sensitive to modelling errors. 

The secondary process presents a smaller time constant than the primary 
process, so that frequently the dynamics of the secondary process is neglected 
and it is simply represented by a pure gain (either Gp 2 or Gd 2 )- The only 
remaining dynamics in the process is then present in Gpi and Gdi- 

Cascade control is realized by two conventional controllers, but the control 
of the secondary loop does not need to be realized as well as that of the primary 
loop. For the secondary loop, a proportional controller can be used (which will 
produce a steady-state deviation compensated for in the primary loop), or a PI 
controller which will suppress this deviation. The secondary loop reacts much 
faster than the primary loop. Thus its phase lag is smaller and the critical 
frequency cjc, if if exists, is larger. Larger gains can thus be used. 

The controllers tuning is done according to the following procedure: 

— First, tune the secondary controller by a classical method (e.g. Cohen-Coon 
or Ziegler-Nichols); make sure whether a P or PI controller is necessary. 

— From the Bode plot of the global system, determine the critical crossover 
frequency uJc by using the previous tuning of the secondary controller. Then, 
from the frequency response techniques, tune the primary controller. 

In nearly all chemical processes, flow rate control is realized by cascade (e.g. 
heat exchanger, distillation column, absorption column) 

Cascade control is frequently used in industrial processes. Figure 6.8 re- 
presents a liquid phase catalytic reactor. In one step, the pressure above the 
liquid phase is controlled by a cascade control by measuring as a secondary 
variable the feed flow rate. In the other step, the temperature in the reactor 
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is controlled by a second cascade control by measuring as a secondary variable 
the fluid temperature exiting the heat exchanger. 

Example 6.1: Cascade Control 

The simulated process is decomposed into two dynamic parts presenting time 





Figure 6.10: Process response in a control without cascade (top figure) or with 
cascade (bottom figure) to a unit step set point (at t = 0), a step disturbance 
of amplitude 0.1 on the secondary loop (at t = 20) and a unit step disturbance 
on the primary loop (at t = 40) 



constants of different orders of magnitude, small for the secondary process 
and larger for the primary process. Moreover, the primary process presents a 
time delay. Two disturbances affect the process: di inside the primary loop 
and d 2 inside the secondary loop. The influence of the cascade is studied 
by comparing the response with and without the cascade. The tuning of the 
primary controller is realized in both cases according to Ziegler-Nichols, Table 
4.2. The data are as follows 



Gp2 



10 

5 + 1 



7 Gd2 



10 

5+1 



, Cm2 = 0.25 , G, = 5 




Process Control 



249 



30 exp(— 0.5 s) 
105+1 



, Gdi 



4 exp(— 0.5 s) 
105 + 1 



Gml — 0-1 



where the time unit is minutes. 

In the case without the cascade, Ziegler-Nichols tuning provides a PID 
controller that has the following characteristics 



Kc = 0.072 , T/ = 2.25min , td = 0.56min (6.22) 

In the case with the cascade, a proportional controller is chosen for the 
secondary loop. The gain can be freely chosen, as the secondary loop presents 
no crossover frequency. Nevertheless, as the steady-state gain of the open-loop 
transfer function of the secondary loop is equal to 12.5, a controller proportional 
gain equal to 1 is considered sufficient. For the primary loop, a PID controller is 
chosen and tuned by a Ziegler-Nichols, which gives the following characteristics 



Kc = 1.546 , T/ = 1.12min , td = 0.28min (6.23) 

This system is successively subjected to: 

• At t = 0, a unit step set point, 

• At t = 20 min, a step disturbance di of amplitude 0.1, 

• At t = 40 min, a unit step disturbance ^ 2 - 

Figure 6.10 shows that the presence of the cascade is visible as expected, as the 
influence of disturbance d 2 affecting the secondary loop is strongly decreased, 
the influence of disturbance d\ is also reduced, the dynamics of the response 
to a set point variation is faster and the overshoots are less pronounced. The 
cascade dampens the disturbances and provides a better response to a set point 
variation. 



6.4 Selective Control 

In a few cases, the process presents more controlled variables than manipulated 
variables. In principle, the number of manipulated variables should be larger 
or equal to the number of outputs. It becomes clear that it will be necessary 
to select the outputs that will realize the control. This is performed by a set 
of hierarchical rules imposed by the user. 

Example 6.2: Protection of a Boiler 

Consider a boiler heated by a coil fed by vapour. If the coil is dry, even par- 
tially, resulting hot spots can lead to coil destruction. In the case of an electrical 
resistance, the problem is identical. Thus, there are two actions that need to 
be realized: the control of produced vapour pressure and the safety action, i.e. 
the level control in the boiler. A special system allowing passage from one 
measurement to another will be necessary. 
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Example 6.3: Protection of a Tubular Catalytic Reactor 

The protection of a tubular catalytic reactor against the appearance of hot 
spots that can cause deterioration of the catalyst qualities necessitates mea- 
surement of the temperature at numerous places along the catalyst bed. On 
the basis of these measurements, a selector can choose the highest value and 
base the control on this. Moreover, the redundancy of measurements allows 
greater confidence in the obtained outputs. This problem is common for fiu- 
idized catalytic crackers (FCC) in refineries. 



Example 6.4: Control of a Distillation Column Reboiler 

The control variable is the heat content brought to the reboiler. There exists a 
lower limit on this heat quantity: below this limit, there is not enough vapour 
produced and the liquid does not remain on the trays; there exists a higher 
limit: above this limit, fiooding occurs in the column. 



6.5 Split-Range Control 

In this case, there exists only one measurement and several manipulated va- 
riables. The following example shows the interest in this type of control with 
regard to safety. As for selective control, a set of rules is provided by the user 
to define the strategy to be implemented. 

Example 6.5: Pressure Control of a Chemical Gas Phase Reactor 

The reactor comprises only one measurement: pressure in the reactor. On the 
other hand, two valves must be actuated: one situated before the reactor, the 
other one after it. The actions on both valves must then be coordinated (Fig. 
6 . 11 ). 




Figure 6.11: Split-range control 




Process Control 



251 



If the pressure is too high, the upstream valve is progressively closed accord- 
ing to a linear function of the deviation between the effective pressure and the 
desired pressure Pq, and the downstream valve is progressively opened. If the 
pressure is too low, the downstream valve is partially closed and the upstream 
valve opened. Below some fixed pressure Pi, the downstream valve is com- 
pletely closed, and above a pressure P 2 , the upstream valve is closed. 



6.6 Feedforward Control 

6.6.1 Generalities 

Feedback control can never be perfect, as the manipulated variable is modified 
only when the infiuence of a disturbance is detected. 

Feedforward control is a solution to this problem; the idea is to measure 
the disturbances and to take a decision before their effect is felt on the process 
(Fig. 6.12). 




Figure 6.12: Feedforward control 

Example 6.6: Temperature and Concentration Control of a Perfectly 
Stirred Continuous Reactor 

Suppose that the only two disturbances are the concentration and the tempera- 
ture of the feed stream. Both manipulated variables are the product withdrawal 
fiow rate and the thermal fiuid fiow rate. 

In classical feedback control, the temperature and concentration would have 
been measured in the reactor or in the outlet stream and the action would have 
been taken through both manipulated variables. 

In feedforward control, the temperature and concentration of the inlet 
stream which constitute the disturbances are measured, and a model of the 
process is used to deduce the action on the same manipulated variables as pre- 
viously. 



Feedforward control suffers from several drawbacks: 

• Disturbances must be measured on-line; this is not always possible. 
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• A model is necessary to know the manner according to which the controlled 
variables depend on the manipulated variables and on disturbances. 

• An ideal feedforward controller, i.e. one that is able to maintain the process 
output to the set point value whatever the set point variations and the 
disturbances, is not always physically realizable. 

6.6.2 Application in Distillation 

Example 6.7: Feedforward Control for a Distillation Column 

In a distillation column, frequently the feed is not controlled with respect to its 




Figure 6.13: Realization of a feedforward control to control the feed flow rate 
disturbances 

flow rate F and its composition Zi?, so that, regarding control, these variables 
are considered as disturbances. Figure 6.13 shows how flow rate variations 
can be taken into account in a feedforward control (Ryskamp, 1987). The 
measurement of top pressure is used in cascade control: the output of the 
pressure controller PC is sent as a set point to the reflux (L) flow rate controller 
FC. The output of the top composition controller AC provides an estimation 
of the ratio D/F, which is multiplied by the measured flow rate F and, taking 
into account the time delay, gives the set point of the distillate D flow rate to 
the controller FC of the distillate flow rate. 

In the proposed configuration, only the flow rate is measured, while the 
disturbances of feed composition are not measured. It would be possible to 
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I 



build an additional feedforward control to take them into account. 



6.6.3 Synthesis of a Feedforward Controller 

A model of the process is necessary; the more accurate this model, the better 
the action of the feedforward controller. 



Feedforward controller 




L J 



Process 

Figure 6.14: Block diagram of feedforward control 

The process model is described by two transfer functions: one for the manipu- 
lated variable, the other one for the disturbance 

y(s) = G„(s) C/(s) + Gd(s) D{s) (6.24) 

To obtain perfect tracking, the following condition must be fulhlled 

y(t) = yr(t) Y (s) = V(s) (6.25) 

SO that ideally the following can be written 

i;(s) = U{s) + D{s) (6.26) 

from which the input U (s) is deduced, which takes into account the changes of 
the set point t/r and of the disturbance d 

To constitute a feedforward controller, the control system must include both 
terms of the set point and the disturbance and must also comprise two blocks. 
Thus, in the block diagram in Figure 6.14 (leave aside the measurement and ac- 
tuator blocks) the feedforward control system includes, besides the comparator, 
a hrst block representing the main transfer function (related to the disturbance) 
of the feedforward controller equal to 

U{s) _ Gdjs) 

D{s) Gu{s) 



(6.28) 
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and a second block, the transfer function of which related to the set point is 
equal to Gc{s) such as 



U{s) 1 
V-(s) G„(s) 



(6.29) 



From the block diagram, the control law integrating the feedforward controller 
is now written as 



U{s) = Gc{s) Gr{s) Ga{s) V-(«) “ Gmd{s) Gr{s) G,(s) D{s) (6.30) 

so that by identifying Eqs. (6.27) and (6.30), the following relations are ob- 
tained 



1 

Gu{s) 


— Gc{s) Gr{s) Ga{s) , r> ( \ ~ ^rnd{s) Gr{s) Ga{s) 


(6.31) 


giving 


C („\ - Gmd{s) 


(6.32) 


and 


G (") - *^‘**'®^ 

^ G„rf(s)G,(s)G„(s) 


(6.33) 


Checking: 







Taking into account the disturbance measurement and the actuator, from the 
block diagram thus constituted, it can be written 



Y (s) — [Gd{s) — Gmd{s) Gr{s) Ga{s) Gu{s) ] D{s)-\-Gc{s) Gr{s) Ga{s) Gu{s) Yr{s) 

(6.34) 

1) So that the disturbance influence is zero (disturbance rejection), the factor 
of D{s) must be zero, thus 



which results in 



Gr{s) 



Gd{s) 

Gmd{s) Ga{s) Gu{s) 



Y{s) 



Gc{s) Gr{s) Ga{s) Gu{s) Tr('§) 
Gc(s) Gdjs) 

Gmd{s) ^ 



(6.35) 



(6.36) 



2) A perfect set point tracking is desired, thus T(s) = T^(s), giving 



Gc{s) Gd{s) — Grrid{s) (6.37) 

hence 

Notice that the transfer functions thus calculated of the feedforward controller 
Gr{s) and Gc{s) depend on the process model through the reciprocals of the 
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transfer functions Gd{s) and Gu{s), which constitutes a main drawback of 
this control. To be physically realizable, a transfer function must have its 
numerator degree lower or equal to that of the denominator. Thus, when the 
theoretical transfer functions Gr{s) and Gc{s) do not fulhl this condition, it 
will be necessary to hlter independently each of them by a hlter of the form 



1 

(r 5 + 1)^ 



(6.39) 



Moreover, we must consider possible time delays or positive zeros present in the 
transfer functions intervening in the denominators of the expressions of Gc{s) 
and Gr{s), as they would correspond respectively to advances or unstable poles 
for GJs) and Gr(s). Consequently they must be eliminated in the calculation 
ofGcls) and Gr{s). 



6.6.4 Realization of a Feedforward Controller 

This type of controller is, in fact, difficult to implement by an analog design; 
on the contrary, numerically, it is implementable on a microprocessor or a 
computer. 

In the following discussion, to simplify, we consider that the transfer func- 
tions of the measurement and the actuator have no dynamics and thus are pure 
gains Gm = and Ga = If it were not the case, the following discussion 
should take the dynamics into account, but the reasoning would be similar. 

Both transfer functions characteristic of the feedforward controller now de- 
pend, from a dynamic point of view, only on Gd and Gu- For these two transfer 
functions, two parts are distinguished: a steady-state element corresponding to 
the steady- state gain and a dynamic element, giving the notation 

Gd{s)=KdG'M and Gu{s) = G'^{s) (6.40) 



so that the steady-state gains of the dynamic parts G'^{s) and G[^{s) are equal 
to 1. 

The feedforward controller itself will present two components 

— A steady- state component. 

— A dynamic component. 



Calculation of the steady-state part: 

It suffices to take for each transfer function its steady- state component, which 
gives the steady-state elements of the feedforward controller according to pre- 
vious equations 




and 



Kc 






(6.42) 



Calculation of the dynamic part: 
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Rather than use the true transfer functions, first-order approximations are 
taken for G'^{s) and thus 






1 

Ti S + 1 



and G^(s) = 



1 

T2 S + 1 



(6.43) 



The dynamic elements of the feedforward controller are then 



r' isi = = T 2 S + I 

’ G'^{s) n S + 1 



(6.44) 



and 

G'c{s) = = Ti s + 1 (6.45) 

The dynamic part G'^{s) of the transfer function of the feedforward controller is 
called a lead-lag element, as the numerator T 2 5 + 1 introduces a phase advance, 
while the denominator ti s 1 introduces a phase lag. According to the value 
of parameters ri and T 2 , the lead-lag element can be either an advance or a 
lag. The theoretical transfer function G'^ is not physically realizable, so that it 
is necessary to filter it by at least a first-order filter of unit gain. 



Physical Realizability and Pure Time Delays 

Suppose that both transfer functions G'^ and G'^ contain respective time delays 
td and tu and that the rest of the transfer function is constituted by the same 
first-order 






exp{-td s) 
r 5 + 1 



and G'^{s) 



exp{-tu s) 
r 5 + 1 



(6.46) 



In this case, it would result in 



G' 

G'r = = exp((t„ - td) s) 



(6.47) 



which means that if tu is larger than td^ then this is an advance term! Thus, 
future values of the disturbance would have to be known to calculate the value 
of the input. This is, of course, impossible: the ideal feedforward controller is 
physically unrealizable. A physically realizable controller can be constituted 
by an approximation of the theoretical controller. 



6.6.5 Feedforward and Feedback Control 

As the requirements of feedforward control are very important, it seems inte- 
resting to add to it a feedback control which will correct its imperfections (Fig. 
6.15): 

• Non identified or ill-known disturbances. 

• Variations in the operating conditions of the process. 

• Model of the incomplete process. 
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Feedforward controller 




Figure 6.15: Block diagram of feedforward and feedback control 



However, the philosophy is that the feedforward controller should do 90% of 
the job while the feedback would correct the remaining 10%. 

According to the block diagram (Fig. 6.15), the expression of the system output 
Y with respect to inputs Yr and D from the system equations 

Y = GuU^GdD (6.48) 

and 

U = Ga (Ual + Ua2) = (^rl + ^r2 ^ 2 ) (6.49) 

with 

E2 = GcYr-GmdD (6.50) 

and 

E,=Yr-Ym = Yr- Gmi Y (6.51) 

thus 



Y = GuGa Grl {Yr ~ Gml Y) + Ga Gr2 (^c " Gmd D)^GdD 

hence 

^ _ Gu Ga {Grl + Gr2 G c) y ^ Gd ~ Gu Ga Gr2 Gmd jj 



1 + Gu Ga Grl Gjni 



1 + Gu Ga Grl Gjni 



(6.52) 

(6.53) 



The stability analysis realized for feedback systems is not modihed by the 
addition of a feedforward loop. Actually, the characteristic equation: 

1 + Gu Ga Grl Gml = 0 depends only on the feedback elements. 

To hnd the characteristics of the feedforward system, the reasoning is based 
on the fact that the control work must be realized by the feedforward action. 
A perfect disturbance rejection is demanded, thus 



^d — ^r2 ^md 



(6.54) 
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hence 



G 



r2 — 



Gd 



Gu Ga Gmd 



(6.55) 



To find the transfer function linked to the set point, the set point can be 
separated into two terms: one acting on the feedback and the other one 
Yr 2 acting on the feedforward. Thus, we can consider = 0 and D = 0, 

This results in 



Gu Ga Gr2 Gc — 1 



(6.56) 



hence 



Gr — 



Gmd 

Gd 



(6.57) 



If the feedforward action were not perfect, a deviation 62 7 ^ 0 would occur, and 
the feedback would partially compensate the deviation. 



6.7 Ratio Control 

In ratio control, the objective is to maintain two measured variables at a con- 
stant ratio. This control is most often applied in the case of fiow rates ratios. 
The remarkable characteristic is that only one of the fiow rates is manipulated, 
denoted by m; the other fiow rate is called “wild” and corresponds to a distur- 
bance d. A typical application is the control of the refiux rate of a distillation 
column: the ratio of the refiux fiow rate to the distillate fiow rate must be 
maintained constant. 

The desired ratio R is thus equal to 



777 

i?=— (6.58) 

This control can be realized in one of two ways: 

• Either both fiow rates (“wild” and manipulated) are measured and their 
calculated ratio is compared to a desired ratio; the deviation between the 
ratios is used as the input of the ratio controller to correct the manipulated 
stream. This method implies the use of a divider (nonlinear). The process 
open-loop gain in this case is equal to 



K = 



dR \ _ 1 
dmJd ~ d 



(6.59) 



and thus is the inverse of the disturbance (“wild” stream). 

• Either the “wild” fiow rate is measured and multiplied by the desired ratio, 
then compared to the fiow rate that the other manipulated stream should 
have. In this method, the process open-loop gain remains constant. 

In ratio control, it appears that in both cases the disturbance must be measured. 
Thus, it can be considered as a particular form of feedforward control. 

Example 6.8: Ratio Control for a Distillation Column 

Eor a distillation column, Ryskamp (1980) proposed to manipulate the ratio 
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DjVn to control the top composition. Because of the mass balance = L + L), 
\i D jVn is hxed, LjD and LjVn are also hxed. The system described by Fig. 
6.16 responds to heat power variations at the reboiler in the following manner: 




Figure 6 . 16 : Ratio control at the top of a distillation column 

increasing the heat power induces a pressure increase, which is counterbalan- 
ced by an increase of its output, i.e. the total top vapour Vn- This signal is 
on one hand multiplied by the output D jVn of the composition controller to 
provide the set point of distillate D flow rate. On the other hand, it enters 
the comparator, where the measured flow rate D is subtracted from it to give 
the set point of the reflux L flow rate. The pressure controller thus increases 
L and D according to a ratio set by the output of the composition controller. 
Thus, the reflux rate varies without a time delay at the request of composition 
controller. 
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Chapter 7 

State Representation, 
Controllability and 
Observability 



In this chapter, only notions relative to state-space linear systems are studied. 
The fundamental concepts of controllability and observability are described in 
an analogous manner in continuous and discrete time; they even can be dehned 
independently of the state-space representation. 



7.1 State Representation 

7.1.1 Monovariable System 

The dynamic behaviour of a monovariable system (single-input single-output) 
is expressed by an ordinary differential equation of order n (equivalent to a set 
of n first-order ordinary differential equations) or a partial differential equation; 
for a multivariable system, it is transformed into a set of ordinary differential 
equations. Using a numerical scheme of discretization of ordinary derivatives, 
it is possible to represent any system under a discrete form in the state space. 

A continuous monovariable system is described by the following state-space 
representation 

x{t) =Ax{t)^Bu{t) . . 

y(t) = Cx(t) + Du(t) ^ ^ 

X is the state vector of dimension n. A{n x n) is the state matrix (or evolution 
matrix), B{nxl) the input matrix (or control matrix) and C(1 x n) the output 
matrix (or observation matrix). D is Si scalar in the monovariable case. 

The set of differential equations 



x{t) = Ax{t) + Bu{t) t >0 



( 7 . 2 ) 
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can be integrated between 0 and t when the matrix A has constant coefficients, 
and its solution ^c(O), u) is 



x{t^x{0)^u) = exp{At)x{0) [ e-xp{A{t — T))Bu{r)dT C^-3) 

Jo 

When the coefficients of the matrix A are not constant, the solution corres- 
ponding to an integration between two instants t and to is represented in the 
form 

x{t,x{to),u) = ^{t,to)x{to) ^ / ^{t,r)Bu{r)dr (7.4) 

Jto 

where is called the state transition matrix corresponding to the inte- 

gration of the homogeneous equation 

x{t) = A{t)x{t) C^-5) 

The monovariable system can be represented by the following continuous 
transfer function 

G{s) = C{sl - A)-^B + D (7.6) 

The state representation (A, B^ C, D) constitutes a realization of G{s) and is 
not unique. It suffices to apply an invertible transformation matrix T on this 
realization to find a different state representation 

A' = T~^AT , B' = T~^B , C' = CT (7.7) 



To the continuous monovariable system, corresponds, by time discretization, 
the following discrete system 



x{k + 1) = Adx{k) + Bdu{k) 

y{k) =Cx{k)ADu{k) 



where the instant tk is simply indicated by integer variable k. 

When sampling is performed with a period Tg and a zero-order holder, it 
results that 

Ad = exp(ATs) 

Bd = [ exp{At)Bdt ^ ^ 

Jo 

The monovariable system (7.8) can be represented by the following discrete 
transfer function 

H{z) = C{zl - Ad)-^Bd + D (7.10) 

To proceed from the continuous transfer function G{s) of Eq. (7.6) to the 
discrete transfer function of Eq. (7.10), it thus suffices to consider 5 or z as 
operators and to replace A and B by Ad and Bd- 
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7.1.2 Multivariable System 

The multivariable (multi-input multi-output) state representation is very close 
to the single-input single-output state representation. The dynamic behaviour 
of a monovariable system is expressed by an ordinary differential equation of 
order n or partial differential equation; for a multivariable system, it is a set of 
differential equations. A continuous multivariable system can be represented 
by the model 

x{t) =Ax{t)^Bu{t) m 

y{t) =Cx{t)ADu{t) ’ 

If the state vector x(t) has dimension n, A state matrix (or evolution matrix) 
has dimension (n x n), B input matrix (or control matrix) has dimension 
{n X Uu) C output matrix (or observation matrix) has dimension (riy x n), 
while D is the coupling matrix (or direct transmission matrix) of dimension 
(% X n^). 

These equations can be transformed to obtain the system with respect to 
the Laplace variable 

sX{s) = AX{s) ABU{s) 

Y{s) = CxIs)aDu\s) 

The Laplace transform of the state vector is 

X(s) = [si - A]-'- BU{s) 

and the Laplace transform of the output is equal to 

Y{s) =C[sI - A]-^ BU{s) + DU{s) (7.14) 

The multivariable system (7.11) can be represented by the matrix of continuous 
transfer functions of the form 



(7.12) 

(7.13) 



G(s) = C{sl - A)-^B + D 



(7.15) 



For sufficiently large |s|, G{s) can be expressed as a Markov series expansion 



CB CAB CA^B 

G{s) = D + + — ^ ^ — + . , 



(7.16) 



where the factors CA^~^B are Markov parameters. 

By using a numerical scheme of discretization of ordinary derivatives, any 
system can be represented in a discrete form in the state space. The discrete 
state-space model is 



x{t + 1) = Adx{t) + Bdu{t) 

y(t) = Cx(t) + Du(t) 



The state vector x(t) has dimension n. Ad has dimension (nxn), Bd dimension 
{n X flu), C dimension {uy x n) and D dimension {uy x Uu)- 

This multivariable system can be represented by the matrix of discrete 
transfer functions in the form 



H{z)=C{zI-Adr^Bd + D 



(7.18) 
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7.2 Controllability 

The reasoning concerning controllability, observability and realizations, is iden- 
tical for a continuous- time and a discrete- time system. Although examples will 
be chosen for single-input single-output systems, controllability and observabi- 
lity concepts can be easily generalized for multivariable systems. 

Equation (7.17) shows that to generate outputs yj{k) from inputs Ui{k)^ 
the states x{k), which themselves depend on inputs Ui(k)^ are necessary in- 
termediates, thus these states must be controllable. Moreover, it is important 
that the states x(k) can be reconstructed from the measurements constituted 
by inputs Ui(k) and outputs yj{k), thus the states must be observable. 

A system is said to be controllable if it is possible to find an input u that 
allows it to go from a state x{l) to a state x(2) in a finite time. Often, the 
state x{l) is the initial state and time ti the initial instant. 

This condition is not necessarily realized. Consider a monovariable system: 
it is sufficient that one or several states is not influenced by the input. 

Some authors such as Wohnam (1985) distinguish reachability from con- 
trollability. According to Wohnam (1985), a state x is reachable from a state 
a:^(0) if there exists a time t {0 < t < oo) and an input u (in the domain of 
acceptable inputs) such that the solution x{t,x{0),u) of the linear ordinary 
differential Eq. (7.2) is equal to this given x. Let A be the space of reachable 
states from the state a:^(0) and A the space of all states; M is a subspace of A. 
The linear system described by the pair (A, B) is controllable if the subspace 
A is the entire space A (their dimension is the same). In the following, only 
controllability properties will be studied. 

In the case of a discrete monovariable system, Eq. (7.8) allows us to write 
at successive instants, using recurrent relations 

x{l) = Ax{0) -\- Bu{0) 

x{2) =Ax\l) + Bu\l) = A^x{0) + [ABu{0) + Bu{l)\ 

a;(3) = Ax{i) + Bu{i) = A^a;(0) + [A^ Bu{Q) + A Bu{l) + Bu{2)] 



x{k) = Ax{k — 1) + Bu{k — 1) 

= A’^xiQ) + [A’^-^ Bu{Q) + Buil) + . . . + Bu(k - 1)] 

(7.19) 

The state vector, being of dimension n, for k = n, Eq. (7.19) is written as 



x{n) - A^x(0) = [B AB ... A^~^ B B] 



u{n — 1) 
ix(n — 2) 



U{1) 

[ «(0) J 

(7.20) 

Representing by Un-i the vector composed by successive inputs, and starting 
from the most recent inputs 



u^_i = [u{n — 1) u{n — 2) ... ix(l) u{0)] 



(7.21) 
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the necessary and sufficient condition for the vector of inputs Un-i to be cal- 
culable is that the controllability matrix 

C = [B AB A^-^B A^-^B] (7.22) 

has rank n. The pair (A, B) is said to be controllable. 

At any instant k > n, the rank of matrix 

Ck = [B AB A'^-^B A'^-^B] (7.23) 

would have been the same as that of matrix C. 

The necessary and sufficient condition of controllability of a system is thus 
that the controllability matrix C has full rank and is equal to n. 

The rank condition, which is mathematically strict, will only be approached 
at the numerical level 

The system described by the linear difference equation 
y{k) + ai y{k — 1) + . . . + y{k — n) = bi u{k — 1) + . . . + 5 ^ u{k — n) (7.24) 
corresponding to the discrete transfer function 




(7.25) 



(7.26) 



Cc=[hn hn-i ■■■ &2 h ] (7.27) 

where the matrices (Ac, Be, Cc) are expressed in the controllable companion 
form. 

The pair (Ac, Be) such defined is always controllable; to prove it, the cor- 
responding controllability matrix can be shown to be of rank n. The matrix 
Ac is a companion form of the denominator of the transfer function. 

Note that there exist four canonical controllable forms (Borne et ah, 1992; 
Foulard et ah, 1987) that are possible for a monovariable system, depending on 
whether the sequence {ai ... is situated on the first or the last row, the 
first or the last column of matrix Ac. For this reason, the resulting matrices 
Be and Ce differ. Note that the matrix A from the canonical controllable 

^The rank of a matrix A can be numerically calculated as the number of singular values 
of A that are larger than a given scalar e > 0. 
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or observable forms is often ill-conditioned ^ and thus will make numerical 
calculation difficult and inaccurate. 

Consider {Aq, Bq^Co) as a given controllable form. Let P be a transfor- 
mation matrix such that A = P Aq P~^, B = P Bq and C = Co P~^ . It can 
be shown that (A, P, C) constitutes a new controllable form. Thus, again the 
non-uniqueness of the state-space representation of a given system is shown. 

If a state-space system is considered in any form (A, P, C), the polynomial 
characteristic of matrix A equal to 

det^sL — A) = T ci\ ^ T • • • T ctn—i ^ T (7.28) 

depends only on the coefficients of the last line of matrix Ac of its controllable 
canonical form. 

The state-space system can be represented by using the eigenvalues of ma- 
trix A or modes of the system. The form thus obtained is called the modal 
canonical form or Jordan form which will be denoted by (A^c, Cmc)- 
• When the eigenvalues of matrix A are real, they constitute the diagonal 

elements of matrix A^c, oi 



Ai 0 



A 



me 



0 




(7.29) 



• When the eigenvalues of matrix A are complex, they constitute diagonal 
blocks of matrix A^c- Let: Ai = cr + jeo be a complex eigenvalue, its 
conjugate is also a complex eigenvalue. The diagonal block is 



a uj 

—u a 



(7.30) 



and is thus formed by the real parts of the complex eigenvalues on the 
diagonal, and the complex parts on each side of the diagonal. 

• When Ai is a multiple eigenvalue of order j, there corresponds in matrix 
Arac a square block of dimension j of the form 



A 1 0 .. 

0 A 0 

: 1 

0 ... 0 A 



(7.31) 



The matrix A^c corresponding to the modal canonical form is well-conditioned 
and it is recommended to use it rather than a companion form. 

^The condition number of a matrix is the ratio of the largest singular value to the smallest 
singular value of this matrix. A matrix is well-conditioned and thus easily invertible when 
this value is around 1. When the condition number becomes larger than around 1000 or 
more, the matrix is increasingly ill-conditioned. 
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The matrix Bmc is only formed of 1, except for the j — I hrst lines corres- 
ponding to a multiple pole of order j where these are 0. 

The system transfer function corresponding to the modal canonical form 
corresponds to a decomposition as a sum of rational fractions where simple, 
multiple and conjugate complex poles are demonstrated. The coefficients of 
rational fractions are the elements of matrix 

Example 7.1: An Uncontrollable System 

Consider the example in Figure 7.1. The continuous state-space model of this 




Figure 7.1: Uncontrollable system. System block diagram corresponding to 
Eq. (7.32) 



system is given by 



" -4 


0 


0 




■ 2 ■ 


0 


-1 


0 


; B = 


1 


1 


1 


-2 




0 



The controllability and observability matrices are respectively 



" 2 


-8 


32 




0 


0 


1 


1 


-1 


1 


; 0 = 


1 


1 


-2 


0 


3 


-15 




-6 


-3 


4 



(7.32) 



(7.33) 



The controllability matrix C has rank 2 and the observability matrix O has 
rank 3. Thus, one of the states of this system is not controllable. 

The transfer function corresponding to this system is equal to 



G{s) 



3 

{s + 4)(s + 1) 



(7.34) 



It is clear that the pole s = —2 has been cancelled. 

The modal canonical form of this system is obtained by a basis change using 
a transformation matrix P. The modal canonical form thus obtained is 



" -2 


0 


0 




0 


0 


-4 


0 




2,2361 


0 


0 


-1 




1,4142 



;C'„c=[l -0,4472 0, 7071 ] 

(7.35) 
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while the transformation matrix is 



P = 



0.5 -1 1 

1.118 0 0 
0 1.4142 0 



(7.36) 



The block representation of this system (Fig. 7.2) shows that the state xi 
corresponding to the eigenvalue —2 is not controllable. 




Figure 7.2: Uncontrollable system. Block diagram corresponding to the modal 
canonical form (Eq. (7.35)) 



It can be noticed that the matrix A already had its eigenvalues on the diagonal, 
as it was lower triangular. 



7.3 Observability 



A system is said to be observable if, in an interval [UU 2 ], with the input as- 
sumed to be known a priori, the knowledge of the output allows us to determine 
the initial state x{ti) of the system. It suffices that a state does not influence 
the output for this system not to be observable. 

Consider the monovariable discrete system (7.8). The output at time k is 
equal to 



y{k) = CA’^xiO) + CA’^-'-Bu(0) + CA’^~^Bu{1) + . . . + CBu{k - 1) + Du{k) 

(7.37) 

This recurrent relation can be used to make the state vector appear at the 
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initial instant with respect to all past inputs and outputs according to 



’ 2/(0) ' 




■ C ■ 


2/(1) 




CA 


2/(2) 


= 


CA^ 


. y{k) _ 




_ ca'^ _ 



cc(0) + 



D 

CB 

CAB 

I k — 1 






0 

D 

CB 

k-2 1 



0 

D 



... 0 

... 0 

0 0 

CB D 





w(0) 




u(l) 




u{2) 




u{k — 1) 



(7.38) 

To calculate the initial state a:^(0) of dimension n with respect to past inputs 
and outputs, the matrix Ok 



Ok = 



c 

CA 

ca’^ 



(7.39) 



should have rank n. If A; > n — 1, the rank of matrix Ok is at maximum n. 

The necessary and sufficient condition of observability is thus that the ob- 
servability matrix O 



O = 



C 

CA 

CA^-^ 



(7.40) 



has full rank, equal to n. The pair (A, C) is said to be observable. 

Again consider the system described by the difference linear Eq. (7.24). 
The system can be represented by the following observable canonical form 



0 0 —an 




bfi 


1 : 






0 : 


Bo = 




: 0 : 






0 ... 0 1 —ai 




_ 



Co = [ 0 0 1 ■ 



(7.41) 



(7.42) 



where the matrices (Ao,-Bo,Co) are expressed in the observable companion 
form. 
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The pair (A^, Cq) thus defined is always observable; to prove it, it suffices 
to show that the corresponding observability matrix has rank n. The previous 
matrix Ao is also a companion form of the denominator of the transfer function. 

There exist also four possible canonical observable forms (Borne et ah, 1992; 
Foulard et ah, 1987), for a monovariable system. 

In a similar manner to the controllability study, if a state-space system 
is considered under any form (A, S, C), the characteristic polynomial of the 
matrix A equal to 

det(5F — A) = T ci\ T • • • T ctfi—i s T ciji (7.43) 

depends only on the coefficients of the last column of the matrix Aq of its 
observable canonical form. 

Controllability and observability are dual notions. It is possible, for exam- 
ple, to go from the controllability matrix to the observability one by considering 
a second system where A and B are respectively replaced by A^ and . 

Example 7.2: An Unobservable System 

Consider the example in Figure 7.3. Notice the similarity with the example used 




Figure 7.3: Unobservable system. Block diagram of the system corresponding 
to Eq. (7.44) 



to illustrate the loss of controllability. The state-space model of this system is 
given by 



" -2 


0 


0 




■ 1 ■ 


2 


-4 


0 


; B = 


0 


1 


0 


-1 




0 



The controllability and observability matrices are, respectively 



" 1 


-2 


4 




0 


1 


1 


0 


2 


-12 


; 0 = 


3 


-4 


-1 


0 


1 


-3 




-15 


16 


1 



(7.44) 



(7.45) 



The controllability matrix C has rank 3 and the observability matrix O has 
rank 2. One of the states of this system is thus not observable. 

The transfer function corresponding to this system is the same as in the 
previous example and is equal to 



G{s) — 



{s + 4)(s + 1) 



(7.46) 
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The pole s = —2 has been cancelled. The modal canonical form is 



" -1 


0 


0 




1 


0 


-4 


0 


5 ^mc — 


-1 


0 


0 


-2 




1.7321 



A — 

while the transformation matrix is 

P = 



5 me 



1 0 1 
-1 1 0 
1.7321 0 0 



= [1 1 0 ] 

(7.47) 

(7.48) 



The representation as a block diagram of this system (Fig. 7.4) shows that the 
state xs is not observable. 




Figure 7.4: Unobservable system. Block diagram corresponding to the modal 
canonical form (Eq. (7.47)) 



7.4 Realizations 

Any discrete system, described by the matrices A, C, can also be represented 
by the discrete transfer function 

H{z) = C{zI - A)-^B (7.49) 

The corresponding continuous-time model and transfer function could be con- 
sidered as well. 

It is only possible by a change of base to go from the controllability canonical 
form to the observability canonical form when the numerator and denominator 
of the system transfer function have no common root: the system is non degen- 
erated. The set of the eigenvalues of A and the set of the poles of the transfer 
function are then identical. In this case, the system is controllable and observ- 
able. The state equation is then a minimal equation and constitutes a minimal 
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realization of the transfer function. Nonminimal realizations use a state vector 
of larger dimension than necessary, given the system transfer function. If a pole 
and a zero can cancel themselves, the concerned pole is either not controllable, 
or not observable. 

In the most general case, a system can be decomposed into an observable 
and controllable part, an observable uncontrollable part, a controllable unob- 
servable part and an unobservable and uncontrollable part. When unobservable 
or uncontrollable parts exist, the transfer function H{z) can be simplified. 

The study of controllability and of observability is, in fact, frequently de- 
licate by use of controllability and observability matrices. The controllability 
and observability grammians possess better numerical properties and, more- 
over, their physical meaning is clearer. For a continuous-time system with 
constant coefficients, the controllability grammian^ is defined by 

pOO 

Gc= exp(Ar)SS^ exp(A^r)dr (7.50) 

Jo 

and the observability grammian by 

pOO 

Go = / exp(Ar)CC^ exp(A^r)dr 

Jo 

Practically, the system is studied for a finite time [0, T] and the transient gram- 
mians are the integrals taken between the boundaries 0 and T, e.g. for the 
transient controllability grammian 

pT 

Gc= exp(Ar)SS^ exp(A^r)dr (7.52) 

Jo 

which has as a limit the controllability grammian when T ^ oo. The control- 
lability grammian (resp. observability) has maximum rank when the system is 
controllable (resp. observable). The number of uncontrollable states (resp. un- 
observable) corresponds to the number of singular values smaller than a given 
threshold e. 

The controllability grammian is the solution of the Lyapunov equation 

AGc + GcA^ + BB^ = 0 (7.53) 

and the observability grammian is the solution of the Lyapunov equation 

AGo + GoA^ + = 0 (7.54) 

For a discrete system with constant coefficients, the transient controllability 
grammian is defined by 

n— 1 

Gc,n = A^BB^A^' , n > 1 (7.55) 

^=0 

^The adjoint matrix A*is the transposed conjugate matrix of A, equal to the transposed 
matrix A^ of A in the classical case of real-coefficient systems. For this reason, equations 
concerning the grammian are sometimes presented with the adjoint matrix instead of the 
transposed one, which has here been retained. 
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and the transient observability grammian by 

n— 1 

Go,n = ^ , n > 1 (7.56) 

^=0 



Similarly to the continuous case, these transient grammians have as a limit 
the controllability or observability grammians when n ^ oo. The condition of 
controllability or of observability is that the corresponding grammian, which is 
a symmetrical positive matrix, is a dehned matrix^. 

The physical meaning of the notion of grammian appears when the energy 
necessary to make a system go from a state = 0 to a state is considered, 
which is equal to 

n— 1 

En = '^ufui = x'^{CnC'^)~^Xn (7-57) 

^=0 



with 



Cn =[B AB ... A^-^B] 

Gc,n = 

As the grammian Gc,n can be dehned by the recurrence 



(7.58) 



Gc,n+i=AGc,nA^+ BB^ 



(7.59) 



the controllability grammian is the solution of the Lyapunov equation 

Gc = AGcA^ + B B'^ (7.60) 



Similarly, the observability grammian is the solution of the Lyapunov equation 

Go = AGo A^ + C C'^ (7.61) 



The grammians allow us to calculate the balanced realization corresponding 
to a given system. A realization is said to be balanced when it possesses 
diagonal and equal controllability and observability grammians. It is then 
possible to proceed to a model reduction by comparing the orders of magnitude 
of the diagonal terms of the grammian and by neglecting those which are lower 
than a given threshold. The model reduction hnds its application in particular 
in the solving of iLoo synthesis problems (Maciejowski, 1989). 

Example 7.3: Balanced Realization and Model Reduction 

Consider the continuous-time system having three poles of relatively different 
order of magnitude, described by the following transfer function 



G{s) 



1 

(5 + 5)(5 + 1)(5 + 0.1) 



(7.62) 



Its state-space representation is 



" -6.1 
1 


-5.6 

0 


-0.5 ■ 
0 


; B = 


■ 1 ■ 
0 


; C = [ 0 0 1 ] (7.63) 


0 


1 


0 




0 





'^The symmetrical matrix A is defined if Ax = 0 x = 0. 
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The controllability and observability matrices of this system are respectively 
equal to 



" 1 


-6.1 


31.61 ■ 




■ 0 


0 


1 ■ 


0 


1 


-6.1 


; 0 = 


0 


1 


0 


0 


0 


1 




1 


0 


0 



(7.64) 



and both have rank 3. 

The controllability and observability grammians of system (7.63) are res- 
pectively equal to 



0.0832 


0 


-0.0149 ■ 




■ 0.1812 


1.1055 


1 


0 


0.0149 


0 


; Go = 


1.1055 


6.7582 


6.1906 


-0.0149 


0 


0.1812 




1 


6.1906 


6.1527 



(7.65) 



The balanced realization calculated for this system (A, 5, C) gives the new 
system 



" -0.0692 


-0.1620 


-0.0527 ■ 




0.3887 


0.1620 


-0.9213 


-0.6722 


; Be = 


-0.4160 


-0.0527 


0.6722 


-5.1095 




0.1483 



Ce = [ 0.3887 0.4160 0.1483 ' 



(7.66) 



and the grammian corresponding to this balanced realization is the diagonal 
matrix, the elements of which are ordered by decreasing order of magnitude 



Ge 



1.0918 0 0 

0 0.0939 0 

0 0 0.0022 



(7.67) 



To proceed to a model reduction, the diagonal elements are compared to the 
first element. A first reduction is performed, choosing as a criterion to eliminate 
the components g{i) lower than ^(1)/100. Thus, the first reduced model is 
obtained 






■ -0.0686 


-0.1690 ■ 


5 Brpl 


0.3871 


0.1690 


-1.0097 


-0.4355 



' 0.3871 



and the corresponding transfer function 



0.4355 ] 
(7.68) 



0.004352- 0.03515 + 0.1957 
52 + 1.07845 + 0.0979 



(7.69) 



which presents the following poles: —0.9784 and —0.1, and zeros: 4.0834 ± 
5. 3665 j. The fast pole of the original system has thus been cancelled and the 
two remaining poles are very close to the corresponding poles of the complete 
system. On the other hand, two zeros have been added. 

A second reduction is performed by taking as a criterion to eliminate com- 
ponents g{i) lower than ^(1)/10. The second reduced model is obtained 



Ar2 = [ -0.0969 ] ; Br2 = [ 0.46 ] ; C 2 = [ 0.46 ] (7.70) 
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and the corresponding transfer function 



Gr2{s) 



-0.18355 + 0.1938 
5 + 0.0969 



(7.71) 



which presents the following pole: —0.0969 and zero: 1.0561. The two faster 
poles of the origin system have been cancelled. One zero has been added. 




Figure 7.5: Model reduction: Response to a unit step, for the complete system 
and two reduced systems 

A step response has been imposed on the original system, as well as on both re- 
duced systems (Fig. 7.5). When only the fastest pole is cancelled, the response 
of this reduced second-order system is totally superposed to the response of the 
origin system (curve with inflection), while when two poles are cancelled, at 
very short times there appears a small deviation of the response (curve without 
inflection for the reduced hrst-order system), then the response becomes very 
close to the original one. 



7.5 Remark on Controllability and Observabi- 
lity in Discrete Time 

Assume that the continuous-time system is controllable and observable. A suf- 
hcient condition for the associated discrete-time system to be also controllable 
and observable is that the eigenvalues \ of the continuous state matrix A is 
such that: 
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• When lZe(Xi — Xj) =0 with i ^ j 

• Then Xm(Xi — Xj) ^ k being a nonzero relative integer, where Tg is 
the sampling period. 
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Part II 

Multivariable Control 




Chapter 8 

Multivariable Control by 
Transfer Function Matrix 

8.1 Introduction 

The transfer function matrices are frequently used in chemical engineering, 
in particular because of the identihcation methods generally employed in this 
domain. In the present chapter, with the system being represented by a transfer 
function matrix, general characteristics relative to multi-input multi-output 
(MIMO) systems and to multivariable control are treated, such as the stability, 
interaction and decoupling. 

Other chapters in the book using, in general, state-space modelling treat 
particular types of multivariable control: linear quadratic control and linear 
quadratic Gaussian control in Chap. 14, model predictive control in Chap. 
16 and nonlinear multivariable control in Chap. 17. Notice that the state- 
space approach is more natural and appropriate for multivariable use than the 
approach by transfer functions. The description of a linear state-space system 
is essentially realized in Chap. 7 together with the notions of controllability 
and observability. 

Multivariable control by PID controllers and multivariable internal model 
control making use of transfer functions are nevertheless treated in this chapter. 



8.2 Representation of a Multivariable Process 
by Transfer Function Matrix 

A multivariable process admits riu inputs and Uy outputs. In general, the 
number of inputs should be larger than or equal to the number of outputs so 
that the process is controllable. Thus, we will assume riu > Tiy. 

The system is supposed to have been identihed in continuous time by trans- 
fer functions. In general, this identihcation is performed by sequentially impos- 
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ing signals such as steps on each input Ui (i = 1, . . . , and recording the cor- 
responding vector of the responses i/ij {j = 1, . . . , n^). From each input-output 
couple {ui^tjij), a transfer function is deduced by a least-squares procedure. 

In open loop, the Uy outputs yi are linked to the riu inputs Uj and to the 
rid disturbances dk by the following set of riy linear equations 

Id = Ui + . . . + Gulriu Un^ + Gdll Di Gdlrid G^rid 

^riy — Gufiyi JJ\ T . . . T GuYiyYi^ + Gdriyl Di GduyUd Dud 

(8.1) 

which will be written in open loop under condensed matrix form as 

Y = GuU^GdD (8.2) 

where y is the output vector, u is the input vector and d is the disturbance 
vector (the modelled disturbances), Gu is the rectangular matrix riy x riu^ 
the elements of which are the input-output transfer functions, and Gd is the 
rectangular matrix riy x rid ^ tbe elements of which are the disturbance-output 
transfer functions. The diagonal elements Guu of matrix Gu represent the 
principal effects between the inputs and the outputs, while the non-diagonal 
elements represent the couplings. 

From now on, it will be assumed, as in most frequent cases, that the number 
of inputs is equal to the number of outputs: the system is called square. In 
closed loop, it is also assumed that the control makes use of as many feedback 
controllers as inputs. Let C be the controller transfer function matrix. Indus- 
trially, it is common that a multivariable process is, in fact, only controlled 
by single-input single-output controllers; in this case, the matrix C contains 
only diagonal elements. It is also assumed that the actuators are represented 
by a transfer function matrix Ga and the measurement devices by a transfer 
function matrix Gm. 

The multivariable process can be represented by a block diagram, where 
each block symbolizes a transfer function matrix (Fig. 8.1). 

The equations are the following 

Y =GuU^GdD 

= GuGaCE^GdD (8.3) 

= GuGaC {Km Yr ~GmY)^GdD 

giving the closed-loop matrix relation 

Y = [I +GuGaCGra]-^GuGaCKraYr + [I +GuGaCGra]-^GdD 

(8.4) 

In the previous expression, the matrix appearing by its inverse 

^y(^) = I Gu Ga c Gm (S-^) 

is called the feedback difference matrix measured at the output (Macfarlane 
and Belletrutti, 1973). 
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Figure 8.1: Block diagram of a multivariable process 



The matrix 

Ty{s)=GuGaCG^ ( 8 . 6 ) 

is called the feedback ratio matrix measured at the output. 

The feedback differences and feedback ratios with respect to the input and 
to the error are also defined by 

Tu{s) =GaCGmGu 
T,{s) =GmGuGaC 
Fu{s) =I^Tu{s) 

F,{s) =I ^T,{s) 

The characteristic equation is thus the expression of the cancellation of the 
feedback difference matrix determinant 

det[-Fy(5)] = det[J + Gu Ga C Gj^] = 0 (S-S) 

In fact, it can be verified that 

det[F^(5)] = det[F^(5)] = det[Fe(5)] (8.9) 

The closed-loop matrix transfer function Gbf{s) relating l^r(5) and 1 ^( 5 ) 
is equal to 

Gbf{s) =F~\s)GuGaG 

= GuFZ\s)GaG ( 8 . 10 ) 

= G„ GaGF;\s) 

and the matrix 

Goi{s)=GuGaC (8.11) 



is called the open-loop transfer function matrix. 
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8.3 Stability Study 

As in the single-input single-output case, the roots of the characteristic Eq. 
(8.8) must be located in the complex left half-plane so that the system is 
closed-loop stable. Several methods allow us to ensure this (Macfarlane and 
Belletrutti, 1973; Maciejowski, 1989). By definition, a transfer function matrix 
is exponentially stable if and only if it is proper and has no poles in the right 
half-plane. 

These results can be transposed from continuous to discrete time by replac- 
ing the right half-plane by the inside of the unit circle. 

8.3.1 Smith-McMillan Form 

Let G{s) be a transfer function matrix, which is not necessarily square, of rank 
r. By a series of operations on the rows and the columns (Maciejowski, 1989), 
G{s) can be transformed into a pseudo-diagonal transfer function matrix AT(s), 
such that 

M(s) =diag{y^,...,y^,0, ...,0} (8.12) 

-|/'l(s) V'r(s) 

where the polynomials e^(s) and are monic, coprime and satisfy the di- 

visibility property 



e^(s) divides e^+i(s) i = 1, . . . , r - 1 . . 

divides i = l,...,r— 1 ^ 

M{s) is the Smith-McMillan form of G{s). 

8.3.2 Poles and Zeros of a Transfer Function Matrix 

Given the Smith-McMillan form of the transfer function matrix G(s), the roots 
of the polynomial product of the ipi 

P{s) = '0i(s) . . . I/Jris) (8.14) 

are the poles of G{s) and the roots of the polynomial product of the 

Z{s) = ei{s) . . . er{s) (8.15) 



are the zeros of G{s). 

8.3.3 Generalized Nyquist Criterion 

For a single-input single-output system, Cauchy’s theorem and the resulting 
Nyquist criterion have been already commented on in Sect. 5.7. 

For a multi-input multi-output system, square transfer function matrices 
will be considered (same numbers of inputs and outputs). 
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From the characteristic Eq. (8.8), the useful complex variable function for 
the Nyquist criterion is 



F{s) = det[/ +GuGaC G^] (8.16) 

Suppose that this function possesses Z zeros and P poles in the right half-plane. 
As for single-input single-output systems, the algebraic number of encirclings 
(counted positively clockwise, negatively in the opposite sense) of the origin, 
when 5 describes the Nyquist contour, is equal to 

N = Z-P (8.17) 

For the closed-loop system to be stable, it is necessary that: Z = 0. 

Frequently, the open-loop process is stable, thus the transfer functions of 
matrix Gu have their poles in the left half-plane. It is assumed that the actua- 
tors and measurement devices are also stable. Moreover, the chosen controllers 
are also (preferably) stable. Assuming that the transfer functions are expressed 
in the form of rational fractions, the poles of T{s) = Gu Ga C Gm are those of 
the function F{s), so that if the process is open-loop stable, the function F{s) 
has all its poles in the left half-plane. It follows that the number of zeros of 
this function in the right half-plane is then equal to the number of encirclings 
of the origin. In these conditions (stability of Gu, Ga, C, Gm), the important 
conclusion is that if the function det[J -\-Gu Ga C Gm] encircles the origin, the 
process is closed-loop unstable. 

In order to keep the analogy with the single-input single-output systems, in 
fact, the function is represented by 

F{joj) = —I + det[J + Gu Ga C Gm] (8.18) 

and the number of encirclings of the point (—1,0) is examined when the pul- 
sation u varies from — oo to +oo. 

8.3.4 Characteristic Loci 

For a large-scale system, the Nyquist multivariable locus (Desoer and Wang, 
1980; Macfarlane and Belletrutti, 1973), can be complicated and difficult to in- 
terpret. It is interesting to plot the eigenvalue loci, also called the characteristic 
loci. 

Suppose that all the controllers are identical and equal to a steady-state 
gain k. The function F{s) becomes equal to 

F[s) = det[J + k Gu Ga Gm] (8.19) 

The poles of F(s) are the poles of GuGaGm- The condition of stability is then 
that the trajectory of F{s) encircles the origin P times anti-clockwise, when s 
describes the Nyquist contour. 

It could be thought that it is necessary to draw the locus of F{s) for each 
value of k. Indeed, if \i{s) is an eigenvalue of GuGaGm, k\i{s) is an eigenvalue 
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of kGuGaGm and l-\-kXi{s) is an eigenvalue of I -\-kGuGaGm- As the function 
F{s) is a determinant, it verifies 

F(s) =n[l + ^Ai(s)] (8.20) 



thus 

arg[F(s)] = arg[l + fcAj(s)] (8.21) 

i 

The number of encirclings of the origin of 1 + kXi{s) could be counted; it is 
preferable to count the encirclings of the point (—1,0) of kXi{s). The graphs 
are, in general, drawn for /c = 1 when co varies from — oo to Too and the graphs 
of Xi{s) are called the characteristic loci. Individually, these graphs do not form 
a closed curve but, as a collective, they constitute a set of closed loops, as the 
imaginary eigenvalues are necessarily conjugate. 

The generalized Nyquist criterion is thus formulated: 

If the function Gu{s)Ga{s)Gm{s) possesses P unstable poles, the closed-loop 
system with the return difference kGu{s)Ga{s)Gjri{s) is stable if and only if the 
set of the characteristic loci encircles the point (—1,0) P times anti-clockwise 
(assuming that no hidden unstable modes exist). 

To go from the characteristic loci obtained with k = 1 to those correspon- 
ding to any gain, it is sufficient to perform an homothety of centre 0 and ratio 
k. The limits of stability have been determined with k = 1 hj examining the 
possible encirclings; the limit values of the gain k are easily deduced. The 
encirclings by G{s) of the point (— l//c,0) could also be considered. 

The generalized inverse Nyquist criterion is obtained by reasoning with 
respect to GqI~^ ( s) = [G^^(5)Ga(5)G^(5)]“^. If Xi{s) is an eigenvalue of G ( 5 ), 
1/A^(5) is an eigenvalue of G~^{s). The generalized inverse Nyquist criterion 
states that if G{s) has Z unstable transmission zeros, the closed loop system is 
stable if and only if the set of the characteristic loci of G~^{s)/k encircles the 
point (—1,0) Z times anti-clockwise (assuming that no hidden unstable modes 
exist). 

8.3.5 Gershgorin Circles 

According to the Gershgorin theorem, given a complex matrix A of dimension 
(n X n), its eigenvalues A are at least located in one of the discs defined by 

n 
i = i 

and in one of the discs defined by 

n 

I A — \aji\ 

i = i 



(8.23) 
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A Nyquist array of Goi (s) is obtained by drawing an array of the Nyquist 
loci of all the elements gij{s)^ where each locus referenced {i^j) occupies the 
position (i, j) in the array of the graphs. 

The diagonal graphs, thus corresponding to the elements gn{s)^ are con- 
sidered. In each point of the graph of gn{s)^ a circle can be placed with the 
following radius 



or (8.24) 

i=i i=i 

The set of these circles forms a band enclosing the locus of called the 

Gershgorin band, and the circles are the Gershgorin circles. 

If the Gershgorin bands of a matrix do not contain the origin, this matrix 
is diagonally dominant. The more diagonally dominant a matrix is, the nar- 
rower the Gershgorin bands are. To obtain the stability, it is necessary that 
[I + Goi{s)] is diagonally dominant. It is considered that [I + Goi{s)] is dia- 
gonally dominant if the Gershgorin bands of Goi{s) do not encircle the point 
(—1,0). The more diagonally dominant Goi{s) or [I + Goi{s)] is, the closer 
the multivariable system Goi{s) is to a set of single-input single-output sys- 
tems without interaction. In the case where no Gershgorin bands of Goi{s) 
contain the point (—1,0), the closed-loop stability will be checked by counting 
the number of encirclings of the point (—1,0) by the bands, which is equivalent 
to using the characteristic loci for this particular case. However, one or several 
bands can contain the point (—1,0) without the system being unstable. 

Similarly, the generalized inverse Nyquist criterion allows us to test the 
stability: 

If all Gershgorin bands of G~i^{s) exclude the point (—1,0), the stability will 
be checked by counting the number of encirclings of the point (—1,0) by the 
bands. 

Rosenbrock (Maciejowski, 1989) gave a necessary and sufficient condition 
of stability: 

Gonsider a system with riu inputs and riu outputs. Let K = diag{/ci, . . . , kn^} 
be the matrix of the controller gains. Suppose that the inequality 



gu{s) + Ty 

tZj 



>EI5*.(s)| 



\/i 



(8.25) 



is verihed for all s of the Nyquist contour and that the i-th Gershgorin band 
of G{s) encircles the point (— l//c^,0) Ni times anti-clockwise. The closed-loop 
system with return difference —G{s)K is stable if and only if 



^ V = P (8.26) 



(assuming that no hidden unstable modes exist). 
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8.3.6 Niederlinski Index 



This method can be applied only when PI controllers are used in the control 
loops. The method is especially used to avoid pairings of variables (an input 
linked to an output) which would be unsatisfactory. It also allows us to test 
the stability in some cases. Grosdidier et al. (1985) corrected the condition of 
Niederlinski. The Niederlinski index uses only steady- state gains of the transfer 
function matrix of the process Gu- Suppose that Gu^ Ga and Gm are stable 
and note that G = GuGa- The controller matrix C is diagonal and the gain of 
each controller is positive. The matrix GqI = GC is rational and proper. All 
control systems taken individually are stable. The Niederlinski index is equal 
to 



det[G(0)] 

Gii{0) 



(8.27) 



where Gij is the transfer function relating the input j and the output i. If 
this index is negative, the system will be unstable whatever the tuning of the 
controllers. If it is positive, it is impossible to conclude. Thus, it is a sufficient 
condition, except for multivariable systems of size lower than or equal to 2, 
where it is also necessary. 



8.4 Interaction and Decoupling 

A multivariable system presents the particularity that the inputs are coupled 
to the outputs. Different methods exist, allowing us to ensure at least a partial 
decoupling for a multivariable system. This is particularly important in the 
treatment by a transfer function matrix, but does not intervene as explicitly 
in linear quadratic Gaussian control (Ghap. 14) or in model predictive control 
(Ghap. 16). Most of the methods presented here are based on the fact that 
the interaction is undesired. But if the fact that a set point variation acts 
simultaneously on several outputs is annoying, this is not necessarily the case 
concerning the disturbance action on the outputs. In general, the most impor- 
tant point is that the process outputs follow the set point correctly, whatever 
the disturbances. A decoupling based on the set points in reality can make the 
disturbance rejection more difficult. The different methods presented must al- 
ways be evaluated under these two aspects: set point tracking and disturbance 
rejection. 



8.4.1 Decoupling for a 2 x 2 System 

The aim of decoupling is to make the outputs yi independent of the variation 
of any input Uj with i ^ j. Thus, for the 2x2 system, the output yi will be 
only influenced by the variations of ixi, thus of the set point yd and not of 1 x 2 - 
This result is obtained by adding, for example, to the controller a decoupling 
element De (Fig. 8.2). 
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Figure 8.2: Interposition of a decoupler between the controller matrix and 
the process matrix 



In the absence of decoupling, the open-loop equations are the following 

= Gull Ui + Gu12 U 2 + Gdi Di . . . . . 

I2 = Gu21 Ui + U2^Gd2D2^... 

or in matrix form 

Y = GuU^GdD (8.29) 

and in closed loop 

Y = [I Gu Ga C Gjri] ^ Gu Ga C Yj. [I + Gu Ga C Gju] ^ Gd D (8.30) 



In the presence of decouplers represented by a matrix De interposed between 
the controller matrix and the process transfer function matrix, this equation 
becomes 



Y = [I +GuGaDeCG^]-^GuGaDeCYr + 

[I +GuGaDeGGm]-^GdD (8.31) 

= Y-p -\- A 2 D 



To avoid the interaction during set point variations, the matrix Ai must be 
diagonal. This type of decoupling that does not take into account disturbances 
is not strongly recommended. The following discussions will detail the reasons 
for the limitations of such decouplers. 

Example 8.1: Application to Wood and Berry Distillation Column 

Wood and Berry (1973) identihed the transfer functions of a distillation column 
considered as a 2 x 2 system, having as inputs the reflux and reboiler vapour flow 
rates and as outputs the distillate and bottom mole fractions. The influence of 
the feed flow rate disturbance has been added to this model (Deshpande and 
Ash, 1988). The complete model is the following 







12.8 exp(— 5 ) —18.9 exp(— 35 ) 


1 _ 


U)(s) ■ 




16.75+1 215 + 1 




R{s) 


n?(s) . 




6.6 exp(— 75 ) —19.4 exp(— 35 ) 










L 10.95+ 1 14.45+ 1 J 


1 



+ 



3.8 exp(— 85 ) 

14.95+1 

4.9 exp(— 35 ) 

13.25+1 



F{s) 



(8.32) 



The chosen steady-state regime is as follows: 
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yo = 0.96, ys = 0.02, R = 1.95 Ib/min, V = 1.71 Ib/min, F = 2.45 Ib/min. 
This system was simulated in closed loop in the presence or in the absence of 
decoupling. The controllers are PI and the tunings are those cited by (Luyben, 
1990), obtained by the method of largest modulus search (Sect. 8.7.1): 

Kri = 0.375 , m = 8.29 , Kr2 = -0.075 , r/2 = 23.6 




Figure 8.3: Wood and Berry distillation column. Closed-loop representation 
without decoupling 

The closed-loop system without decoupling is represented in Fig. 8.3. 

The system is first subjected to a distillate set point variation equal to —0.02 
(Fig. 8.4). The bottom mole fraction strongly decreases before coming back to 
its nominal regime after a long period (about 120 min). Also note the influence 
of the time delays. The same system is subjected to a bottom product set point 
variation equal to 0.01 (Fig. 8.4). The distillate mole fraction is influenced, 
but more weakly than the bottom of the column in the previous case. 

The same system is subjected to a feed flow rate disturbance step of 0.05 
Ib/min occurring after 20 min (Fig. 8.5). The bottom mole fraction is neatly 
more influenced than the top one. 

The closed-loop system with decoupling is represented in Fig. 8.6. The 
decouplers have been calculated by the simplified decoupling. They are equal 
to 

= ; C2. = ^ (8.33) 

Frll Ct22 

As time delays exist, it is important that the decouplers are physically realiz- 
able. For example, hadd the time delay related to Gu been larger than the time 
delay related to G 12 , the time delays would not have been taken into account. 
The closed-loop system with the same controllers but with decoupling is sub- 
jected to the same set point variations and the same disturbance. Figure 8.7 
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Figure 8.4: Wood and Berry distillation column without decoupling. Set 
point step of distillate —0.02 (left column) and set point step at the bottom 
0.01 (right column). Distillate and bottom mole fraction responses 







Figure 8.5: Wood and Berry distillation column without decoupling (left 
column) and with simplihed decoupling (right column). Disturbance step of 
the feed flow rate 0.05 Ib/min occurring after 20 min. Distillate and bottom 
product mole fraction responses 



shows that the decoupling effectively acts by totally cancelling the multivaria- 
ble effect. However, it must be noted that these results have been obtained 
m simulation and assume a perfect model. In reality, the transfer function 
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Figure 8.6: Wood and Berry distillation column. Closed-loop representation 
with decoupling 



is known with uncertainty as well as the time delays. Thus, in practice, an 
influence of the coupling would be observed. 




Figure 8.7: Wood and Berry distillation column with decoupling. Distillate 
set point step —0.02 (left column) and bottom set point step 0.01 (right co- 
lumn). Distillate and bottom product mole fraction responses 



As the decouplers are calculated without taking into account the disturbances. 
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in Fig. 8.5 we note that the system with decoupling is no better with regard 
to disturbances than the system without decoupling; in the present case, it is 
even the contrary. Thus, this controller is not robust. 



8.4.2 Disturbance Rejection 

If a disturbance has no influence on a controlled variable, the modulus of the 
closed-loop transfer function is zero. The objective in the multivariable do- 
main is to choose the best pairings between inputs and outputs so that the 
smallest moduli are obtained. Thus, for a multivariable system of dimen- 
sion m and for each disturbance, the m curves must be plotted in the Bode 
diagram corresponding to the modulus of the m components of the vector 
[I ^ GuGaC Gm] ^ Gd- 

8.4.3 Singular Value Decomposition 

The decomposition of a matrix A of dimension n x p with respect to singular 
values is performed according to 

A = VSW’^ 

where V is an orthonormah matrix n x n, w is an orthonormal matrix p x p, 
and X) is a diagonal matrix n x p with aij = 0 if i ^ j and era = ai > 0. The 
quantities are the singular values of A and the columns Vi of V, respectively 
Wi of TV, are the left singular vectors, respectively right, such that 

AVF, = yi (8.34) 

If A is a square matrix, the matrices A^ A and A A^ have the same nonzero 
eigenvalues. 

The singular values of a rectangular matrix A of dimension n x p are the 
square roots of the eigenvalues of the square matrix A* A and thus are dehned 
by 



[A” A] 

Perform the singular value decomposition G = and arrange the 

singular values in order of decreasing value: > ... > • The columns 

of U denoted by f/i, . . . , Un^ and the columns of Y denoted by Ti, . . . , 

^ A matrix A is orthonormal when it is: 

1) Orthogonal: the set of vectors (columns or rows) composing the matrix is orthogonal, thus 
these vectors are orthogonal between themselves. 

2) These vectors are unitary 



A*A = AA* = I 



where A* represents the transposed conjugate matrix of a matrix A. 
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respectively constitute the principal directions of the inputs and the outputs 
of G. With the matrix U being orthonormal, the principal directions Ui are 
orthogonal between themselves; the same applies to the Yj. If the input is 
chosen in the direction of so ix = alii with the scalar a such that \a\ = 1, 
from the equality y = Gu, results 



y = aaiYi (8.35) 

the corresponding output is in direction Yi and the gain of the system is cr^. 
With each principal gain is thus associated a couple of principal directions. 
Note that, although this calculation can be performed for a non-square system, 
here this simplification is chosen. 

The largest elements (in absolute value) in the columns of Y indicate which 
are the most sensitive inputs (Luyben, 1990). Deshpande and Ash (1988) men- 
tions that the pairing should be realized by pairing the controlled output asso- 
ciated with the largest (in absolute value) element of Y\ with the input asso- 
ciated with the largest (in absolute value) element of l/i, the same applies to 
the following directions 1 / 2 , I 2 , and so on. 

8.4.4 Relative Gain Array 

The interaction between loops can be evaluated by a method based on the 
study of the relative gain array (RGA) introduced by Bristol (1966). The loops 
infiuence themselves in a more or less important manner and a possible effect 
is that some loops destabilize the closed-loop system. The RGA method is 
relatively easy to implement, and for this reason is frequently used in chemical 
engineering. It is limited in its original form, as it uses only steady-state in- 
formation. It can be extended by using frequency representations (Hovd and 
Skogestad, 1992; Skogestad and Morari, 1987b). 

Steady-state Relative Gain Array 




Figure 8.8: Block diagram of an open-loop multivariable 2x2 process 
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The simplest case that can be studied is that of a system with two inputs and 
two outputs considered around a steady state (Fig. 8.8) 




Figure 8.9: Block diagram of a multivariable 2x2 process with a controller 
on loop 2 



Then suppose that only the output i /2 is controlled by installing a controller 
with transfer function C 2 {s) (Fig. 8.9). The system becomes equal to 



U(s) 

y2{s) 




1 + G22(s) 




The transfer function between ui and yi is thus modihed (the subscript c/22 
indicates that the loop between U 2 and y 2 is closed according to 



Gi2{s)C2{s)G2i{s) 
1 + G22(s)G2(s) 



The ratio of the open-loop and closed-loop transfer functions expresses the 
influence of the loop between U 2 and ^ 2 , so 
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Grosdidier et al. (1985) assume, as is frequently done, that the controller con- 
tains an integral action, which implies (^ 2 ( 0 ) = + 00 , and the measure of the 
interaction /in ( 5 ) is denoted by An, which depends only on the steady-state 
gains of the transfer function G, thus 



An = 



Gii{0) G22(0) 



Gn(0) G22(0) — ^ 12 ( 0 ) G2 i(0 ) 



(8.41) 



Explanation in terms of asymptotic variations: 

The previous calculation can be found by reasoning for the two-input two- 
output system of Fig. 8.9. The asymptotic value of a variable is denoted by 
x{oo) to emphasize that time tends towards infinity. When the second loop is 
closed and the regulation is perfect, if a step is executed at ui, it induces an 
action G 2 i( 0 )Ani(oo) at y 2 which must be compensated for by U 2 so that 



(T2 i( 0) Aui(oo) + (722(9)^1x2(00) = 0 



A / \ ^ 21 ( 0 ) A / \ 

Axx2(oo) = Axxi(oo) 



(^ 22 ( 0 ) 



The action of the input variation Axx 2 (oo) at yi is 

(72i(0) 



(7i2(0) 



■ Axxi(oo) 



(^ 22 ( 0 ) 

hence the total variation of yi due to the step at ui 



Ayi(oo) = Gii(O) Ami((X)) - Ami(oo) 



(8.42) 



(8.43) 



(^ 22 ( 0 ) 

(7ii(0) (^22(0 ) — (7i2(0) (72i(0) 



(^ 22 ( 0 ) 



Axxi(oo) 



(8.44) 



The closed-loop ratio results 

/ Ayi{oo) A _ ( 7 ii(Q) (^22(0) — ( 7 i 2 (Q) G 2 i{ 0 ) 
\Aui{oo)J^^ ^22(0) 

which, when coupled to the open-loop ratio 

(fy^i) =Gn(0) 

gives the first component of the steady-state relative gain array 

f Ayi(oo)' 



(8.45) 



(8.46) 



All = 



\Au,{ 00 ) 

/ Ayi(oo) 
\Aui{oo)J^i 



Gii(O) G22(0) 



Gii(O) G22(0) — Gi2(0) G2i(0) 



(8.47) 



The other elements of the steady-state relative gain array can be easily deduced 
by means of relation (8.50). 
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By generalizing for a larger dimension of the multivariable system, any 
element \j of the relative gain array A is equal to the ratio of the steady- 
state gain between the i-th output variable i/i and the j-th input variable when 
the system is free of any control over the steady-state gain between the same 
variables when the system is feedback-controlled by the other inputs, assuming 
that in the steady state all the other outputs are maintained at their nominal 
value (the controllers are integral) 



Kj — 



dY, 



dU, 



3 / 



dY, 



dU, 



3 / Yi=0,l^i 



(8.48) 



assuming that the matrix , obtained from the matrix G by suppressing the 
i-th row and the j-th column, is non-singular. 

Denoting by Gij the element (i^j) of matrix G~^, the elements Xij of the 
relative gain array can be calculated (Shinskey, 1988) by the following relation 



Xij — Gij Gji (8.49) 

The elements Xij verify the relation 

flu flu 

= (8.50) 

i=l j=l 

For a perfectly decoupled system, the relative gain array A becomes the 
identity matrix. Theoretically, the objective is that the associations (principal 
effects) between an output variable and an input variable are such that the 
elements of the concerned matrix are positive and are close to 1 (Bristol, 1966). 
When an element (in absolute value) is large, it means that the interaction is 
important. 

For a MIMO system, the general recommendations (Seborg et ah, 1989; 
Shinskey, 1988) are: 

• If Xij < 0, the gain of the concerned pairing changes its sign when passing 
from open to closed loop, yj should not be paired with Ui. For a two-input 
two-output system, if An < 0, yi should not be paired with ui. 

• If Xij ~ 0, the interaction is low, yi is liitemttle inffuenced by Uj. For a 
two-input two-output system, if An ~ 0, y 2 should be paired with ui. 

• If 0 < A^j < 1, the closed-loop gain uj — yi is larger than the open-loop 
gain and the interaction between the loops is maximum for Xij =0.5. For a 
two-input two-output system, yi should be paired with ui only if An > 0.5 . 

• If Xij > 1 , the interaction is high and the degree of interaction between the 
loops increases with Xij. 

Integral Stahilizahility 

The use of the relative gain array assumes that the controllers comprise an 
integral action and can be decomposed into a matrix of integrators k/sl and a 
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Figure 8.10: Integral multivariable control 



matrix of compensators C{s) (Fig. 8.10). Assuming that the matrix H{s) = 
G{s)C{s) is a proper transfer function matrix (limg^oo G{s) = 0), the system 
defined by H{s) is stabilizable (Grosdidier et ah, 1985) by an integral action 
only if 

det(iT(0)) > 0 (8.51) 



Integral Controllability 

The system defined by H{s) is controllable by an integral action only if all 
the eigenvalues of H (0) are in the right complex half-plane. If only one of the 
eigenvalues is located in the left half-plane, the system is not controllable. 
Relation with RGA matrix 

When Xjj{G) < 0, then for any compensator C{s) such that: 

a) G{s)C{s) is proper, 

b) yj infiuences only uj and uj is infiuenced only by yj, resulting in Cji = Cij = 

0 , 

either the closed-loop system is unstable, 

or the loop j is unstable, all the other loops being open, 

or the closed-loop system is unstable when j is taken off. 

Integrity 

Grosdidier et al. (1985) also consider the necessary conditions so that a mul- 
tivariable system remains stable when one of the actuators or sensors breaks 
down (integrity concept) and the coupled output is no more controlled: the 
system is tolerant with respect to a fault of the sensor or the actuator j if 
the complete and the reduced systems obtained by removing this sensor j are 
simultaneously controllable by an integral action. 

Grosdidier et al. (1985), in particular, study the cases of 2 x 2 and 3x3 
systems for which many incomplete results had been previously obtained. Thus, 
for a 2 X 2 system with An >0, there exists a diagonal compensator C{s) such 
that the closed-loop system remains stable in spite of the breakdown of a sensor 
or an actuator. For a 3 x 3 system with Xjj > 0, Vj, if there exists a compensator 
C{s) such that H{s) is controllable by an integral action and Hjj(C) = 0, then 
it is possible to maintain the stability of the system in spite of the breakdown. 
For a 2 X 2 system, a multivariable compensator is not necessary, while it should 
be used for a 3 x 3 system. It is recommended to use the concept of integral 
controllability previously exposed. 

Robustness and relative gain array 

The designer wishes that the control system remains stable even when the 
model G{s) that was used to design the multivariable compensator C{s) does 
not represent anymore perfectly the process G{s). Grosdidier et al. (1985) 
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show that the process running with the same compensator C{s) keeps integral 
controllability, provided that 



l|G(^)-G(^)ll . 1 1 

l|G(s)|| ||G(s)|| \\G~\s)W Cond(G) 



(8.52) 



where Cond is the condition number^ of the matrix G. 

A problem is that the condition number depends on the scaling of the inputs 
and the outputs (realized by multiplication of G by diagonal matrices and 
Dy) opposite to the RGA matrix. Nevertheless, it is possible (Grosdidier et ah, 
1985; Skogestad and Morari, 1987a), to realize a scaling so that a minimum or 
optimal condition number 7 * is obtained. For a 2 x 2 system, this minimum 
condition number can be related to the 1 -norm of the RGA matrix by 

7* = ||Ai|| + v/||AiP-1 (8.53) 



and the following conjecture is proposed (Grosdidier et ah, 1985) for a square 
multivariable system of any order 



7 * <2max[||Ai||,||Aoo||] 



(8.54) 



which is verihed for a second-order system. This rejoins the results concerning 
the stability of 2 x 2 systems for which Shinskey (1979) showed that if An < 0 or 
All > 1 , the error of the decoupler necessary to destabilize the system decreases 
when I All I increases although this error has no influence on the stability when 

0 < All < 1. 

Recalling that Gij are the elements of matrix the relative variations 

of Gij, Gij, Xij are related by 



dXij _ ^ ^ dGij 

— [i Aij) 

Ain CT7I 



dXij 

Xii 



Xij 1 dGij 



Xii 



Gi 



(8.55) 



From Eq. (8.55), when some elements of the RGA matrix A are large 
compared to 1, it results that the system is sensitive to modelling errors. If the 

^Numerically, often the ratio of the largest singular value to the smallest singular value 
is considered as a measure of the condition number denoted by 7 (A) of the matrix. When 
the orders of magnitude of the singular values differ strongly, this means that the problem 
is ill-conditioned, or still that the equations of the associated linear system present some 
dependence. For a linear system AX = B, the solution X would vary largely for small 
variations of B. 

A matrix norm subordinate to a vector norm is defined by 

II A|| = maxaj^o 

Thus, the 1-norm denoted by || A||i is the maximum with respect to the columns of the sum 
of the absolute values of the elements of a column, and the oo-norm denoted by ||A||oo is 
the maximum with respect to the rows of the sum of the absolute values of the elements of 
a row. The subordinate matrix norm (sometimes without subscript in this chapter) is the 
Euclidean norm or 2-norm denoted by || A ||2 equal to the largest singular value of A, where 
the Euclidean norm of vector x is equal to ||a3|| = \/x*x. The largest singular value is equal 
to the 2-norm of the matrix. 
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norm of A is large, and from Eq. (8.54) the minimum condition number is large, 
then the system is practically uncontrollable (Skogestad and Morari, 1987b). 
A system is sensitive to uncertainties on the control variable, whose variations 
are never perfectly known, when the process and the controller simultaneously 
have large elements in the RGA matrix (Skogestad and Morari, 1987b). This 
could explain the sensitivity of decouplers, in particular those called ideal or 
simplified, when the process has large elements in the RGA matrix. In the same 
way, the use of a controller based on the inverse of the model must be avoided 
in these conditions. A one-way decoupler (triangular decoupling matrix) is far 
less sensitive to the uncertainty of the control variable and a diagonal decoupler 
is not at all sensitive, but the latter may provide very bad performances. 

For 2x2 systems, the RGA matrix A and the Niederlinski index give the 
same information (Hovd and Skogestad, 1994). 

Pairing rule for integrity 

It is recommended, for stable processes, to choose pairings corresponding to 
positive values of the Niederlinski index and of the RGA matrix A(0). Hovd 
and Skogestad (1994) give indications for processes presenting one or several 
unstable poles. 

Dynamic Relative Gain Array 

It is possible to represent the modulus of the elements of the relative gain 
array A with respect to frequency (Tung and Edgar, 1981). In the following 
robustness analysis, a systematic frequency study is realized. 

For example, a controller having a relative gain array whose modulus of the 
elements is small at all frequencies is in general insensitive to uncertainties of 
the control variable. 

8.4.5 Gershgorin Circles and Interaction 

If the system were perfectly decoupled, the radii of the Gershgorin circles would 
be zero. The larger the radius, the wider the band and the greater the interac- 
tion. By successive approximations of the controllers, it is possible to reduce 
the bandwidth and to decouple the system at the maximum. In spite of all 
this, the problem of disturbance rejection is not present in this method hence 
its limited interest. 



8.5 Multivariable Robustness 



In single-input single-output systems, the modulus of the closed-loop transfer 
function is examined 



Yr (juj) I + Gol (juj) 



(8.56) 



This fiat curve at low frequencies (for which the ratio \ Y (jco) /Y^ (jco) \ is equal to 
I) presents a resonance peak for a given frequency then decreases when the 
frequency increases. The more important the maximum, the less underdamped 
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the system and thus the less robust the system. More generally, the sensitivity 
and complementary sensitivity functions have been introduced and robustness 
criteria have been dehned (Sect. 5.10) in order to account for uncertainties 
either in the system or in the process environment. 




Figure 8.11: Block diagram of a multivariable process for robustness study 

For a multivariable system, robustness has been developed in a similar way 
(Doyle and Stein, 1981; Maciejowski, 1989; Oustaloup, 1994; Palazoglu and 
Arkun, 1985; Skogestad and Postlethwaite, 1996). In particular, we will exam- 
ine the necessary compromise between performance, i.e. the capability of the 
system to follow a reference trajectory, and robustness, whose objective is to 
react correctly in the face of uncertainties. The uncertainties, being generally 
weak at low frequencies, increase largely with frequency. Different types of 
uncertainties can be considered, e.g. additive uncertainty and multiplicative 
uncertainty equivalent to a relative variation. Taking into account an uncer- 
tainty means that the perturbed system keeps some properties, e.g. stability 
or performance, in spite of the undergone variation. 

Consider the system in Fig. 8.11. The output is equal to 

Y = (I-hGC)-^GC(V;^ -N)-h(I-hGC)-^B 

= T(s)Y;+S(s)D-T(s)N 

= (I- S(s)) Y; + S{s) D-{I- S{s)) N ^ > 

= T(s) Y; + (J - T(s)) D - T(s) N 

where the sensitivity function of the output is equal to 

S{s) = [I + G{s) (8.58) 

and the complementary sensitivity function 

T(s) = [J + G(s) C(s)]-i G(s) G(s) 

= S{s)G{s)C{s) (8.59) 

= J - S(s) 

The tracking error is equal to 

E =Y* -Y 

= (J + GG)-1 {Y; -D) + {I + GG)-^GGN 
= S{s) Y; - S{s) D + T(s) N 



(8.60) 
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Denote by am{G) the smallest singular value of a matrix G and by (G) 
the largest singular value, which verifies 

a^(G) = ||G|| (8.61) 



When the singular values are expressed with respect to frequency: cr^(cj), they 
are called the principal gains of G{s). 

Using the singular value decomposition, it can be shown that 



l(w) < 



\\G{jLo)U{ju;)\\ 

WiMW 






(8.62) 



thus the gain of a multivariable system is a value between the smallest and the 
largest principal gain. 

Doyle and Stein (1981) use limits such as lm{^) to represent a boundary 
for uncertainty, low at low frequencies and high at high frequencies. Here, a 
simplified formulation is qualitatively retained by low or high, which in fact 
utilises the same concepts. 

According to Eq. (8.57), the objective is to maintain the sensitivity *S' at a 
low value at low frequencies (for disturbance rejection), and what is important 
is the upper boundary represented by (S) 

{[I G{s) C{s)]~^) low (8.63) 



On the other hand, the complementary sensitivity function T must be of 
a low value at high frequencies (to minimize the measurement noise infiuence) 
and the upper boundary represented by (T) is of interest 

-[I ^G{s)C{s)]-^) low (8.64) 



The compromise between performance and robustness is the inequality 



and: \l-a^^\T)\ < < l+cr''^(T) 

(8.65) 

Define the output bandwidth ujjjy for which 

a^{T){uby) = 1/V2(T^(T)(0) = 1/^2 

In the case without the precompensator P, the system has only one degree 
of freedom and if the transmission bandwidth ujty is designed such that 

(yra{T){LOty) = 1/V2(T„(T)(0) = 1/V2 



then it results that 



^ty ^ ^by 



(8.66) 



and the designer must seek to minimize {coby — bOty) for the set point tracking, 
which amounts to seeking 



dmil -[I + G{s)C{s)] ^) w 1 and {I - [I + G{s) G{s)] ^) w 1 (8.67) 
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In the case with the precompensator P, the system has two degrees of 
freedom; the transmission bandwidth is then determined by the matrix TP, 
giving cr^(TP) and the designer must try to maximize {coty — ^by) for the set 
point tracking, as it is then possible to make oUfy > 

The input is equal to 

U =[I + CG]-^C {Y; - N - D) (8.68) 

Denote the control sensitivity by 



F-\s) = [I + Cis)G{s)]-^ (8.69) 

The variations of the input signal should be maintained as weak thus, (F~^C) 
must be weak. As 

a^{F-^G) < (8.70) 

i) 

it suffices to make cr^(P~^C') small so that the ratio is small, which is only 
possible if (G) 1 or if cr^(C') <C 1. At frequencies where (G) is not 

large compared to 1, cr^(C') must be maintained as low as possible 

am{C) low (8-71) 

In general, it is interesting to express the previous criteria (8.63), (8.64), 
(8.67), (8.71) with respect to the principal gains of the open-loop transfer func- 
tion matrix GqI = GC (Doyle and Stein, 1981; Maciejowski, 1989). 

Disturbance rejection: 

The criterion (8.63) dealing with low sensitivity at low frequencies can be ex- 
pressed again as 



a^i[I + Gis)G{s)]-^) < 



1 

am{G{s)C{s)) 



(8.72) 



thus a low sensitivity is obtained when the smallest principal gain of the open 
loop is large: am{GC) ^ 1. 

Minimization of the measurement noise: 

The criterion (8.64) dealing with the complementary sensitivity low at high 
frequencies can be expressed again as 



a^i[I + Gis)G{s)]-^) 



1 

am{I^[G{s)C{s)]-^) 



(8.73) 



thus a low sensitivity to measurement noise is obtained when the largest prin- 
cipal gain of the open loop is small: {GC) <C 1. 

Set point tracking: 

The criterion (8.67) obtained in the case of a system with only one degree of 
freedom can be expressed again as 



J-[J + G(s)C'(s)]-i 



1 



(8.74) 
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which can be transformed into cr^(GC) 1. 

Small input variations: 

The criterion (8.71) gave (C) <C 1. 

The apparent conflicts between these four demands can be solved by con- 
sidering the frequency domains where each of them must be applied. 

The closed-loop performance can be deduced from the moduli of the char- 
acteristic loci when the matrix GC is normal^. If the matrix GC is not quite 
normal, the characteristic loci can still be used. If the deviation with respect 
to normality which can be calculated according to Maciejowski (1989) is large, 
the only indication is 

(^m < |Aj|cr^ (8.75) 




Figure 8.12: Specification representation according to the open-loop transfer 
function matrix GC of a multivariable process 

The loci of the eigenvalues of GC can be drawn in the classical Bode represen- 
tation and interpreted in the usual manner in order to deduce considerations 
of gain and phase 

|det(GC)| > for: cv < cvi with: am{GC) > L 

|det(GC)| < for: uo > UO 2 with: {GC) < e ^6) 

diigXAGC) < — — ^^s(-^A) open-loop stable 

2 log(cJ2/^l) 



matrix A is normal when it verifies A* A = AA* . 
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This last equation allows us to obtain the following limit 



20 log(L/e) 
log(a;2/a;i) 



< 40dB/decade 



(8.77) 



which, if it is not verihed, leads to difficulties in maintaining the closed-loop 
stability. 

The singular values cr^(GC') and (GC) previously dehned can also be 
drawn with respect to frequency in a similar way to a Bode diagram; these will 
be the loci of the singular values. The specihcations (Fig. 8.12) dealing with 
the smallest and largest principal gains and the frequency bands can be applied 
to build the specihcations (Doyle and Stein, 1981; Maciejowski, 1989) of GC 
similarly to those dehned during the robustness study for single-input single- 
output systems (Kwakernaak, 1993). The frequency constraints provided by 
the criteria to respect are then approximated by continuous transfer functions 
of simple expression which produce frequency weightings Wi and W 2 such that 
the original specihcations (8.63) and (8.64) become 



||lTi^||oo<l and: ||lT 2 T||oo < 1 (8.78) 



8.6 Robustness Study of a 2 x 2 Distillation 
Column 

Example 8.2: Robustness Analysis of a Distillation Column 

As an applied example of robustness analysis of a 2 x 2 system, consider the 
study of a distillation column by Arkun et al. (1984). The comments are taken 
from this paper. Refer also to the example developed in Sect. 20.4.2. 

Denote by De the decoupling matrix of the system having transfer function 
matrix G (Fig. 8.2). The designer of the decoupler wishes that the following 
equation to be satished 

G{s)De{s) = M{s) (8.79) 

where M{s) is a diagonal matrix specihed by the designer. 



8.6.1 Simplified Decoupling Analysis 

Consider the matrix M 

Gi 2 C21' 



M = 



G 



11 



1 - 



Gii G22 



0 



G 



the matrix G being 



Gii 

G21 



22 



Gi2 

G22 



1 - 



C 12 C. 



12 Ct21 

Gii G22 



(8.80) 



G = 



(8.81) 
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A simplified decoupling is given by the decoupling matrix 

- ^ _G^- 

De = ^11 (8.82) 

- G 22 

which results in the following output equations 

Yi = Gii (l - Ui 

) (8.83) 

U = G22 ( 1 - ) C, 

V FrllGr22/ 

As the minimum singular value cFm[I + Goi{joo)] > 0, the system is stable. 
Nevertheless, if both minimum and maximum singular values are lower than 1, 
the system is not robust with respect to sensitivity. Thus, high-purity distilla- 
tion columns are more sensitive to modelling errors and the stability margin is 
smaller than that for low-purity columns (Arkun et ah, 1984). 

8.6.2 Ideal Decoupling Analysis 

Consider the matrix M 

The ideal decoupling is given by the decoupling matrix 
^ Gii 

^ _ G\2 G21 ^ _ G\2 G21 

= G1XG22 G11G22 



I 1-^ 

L Gn( 

which results in the output equations 



G22 

G\2 G 21 
Gii G 22 



G\2 G21 
Gii G 22 



Yi — GiiUi ^8 

>2 = G22U2 

By examining the singular values (Arkun et ah, 1984), it can be shown that 
the simplified decoupling works better than the ideal decoupling. 

8.6.3 One-Way Decoupling Analysis 

Consider the matrix M 



G 12 G 

G,,G 



G12’ 

^22 / 

G22 



(8.87) 
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The one-way decoupling is given by the matrix decoupling 



De 



G 21 

G22 



0 

1 



which results in the output equations 



Id 

I2 



G 



11 



1 - 



G22U2 



Gi 2 G21 

Gii G22 



Ui + Gi2 U 2 



(8.88) 



(8.89) 



8.6.4 Comparison of the Three Previous Decouplings 

Note that 

lm{^) =cTm[I + [G{j^) De{juj) (8.90) 

is the maximum tolerable degree of uncertainty. As Im is maximum for the 
simplihed decoupling, this simplihed decoupling can tolerate more uncertainty 
than the other types of decoupling which were mentioned. 

For a low-purity distillation column, all the decoupling schemes improve the 
robustness. For a high-purity column, the ideal decoupling makes the column 
less robust. 



8.7 Synthesis of a Multivariable Controller 

In all cases, hrst a choice of the manipulated variables and of the controlled 
variables must be realized. The reflection on the variable pairing can be partly 
based on the Niederlinski index and on the analysis of the relative gain array 
(RGA) (Hovd and Skogestad, 1994). It is possible to eliminate some impossible 
variable pairings by use of the Niederlinski index. The best pairings among 
those possible must be chosen by applying methods such as the RGA. Then, 
the synthesis of a multivariable control can be performed at very different 
complexity levels. 

Thus, the multivariable aspect may be ignored by closing the loops individ- 
ually at their turn: hrst a loop is closed while allowing m — 1 open loops, then 
a second loop while allowing m — 2 open loops, and so on. The multivariable 
controller is then diagonal. The method of the largest modulus proposed in 
Sect. 8.7.1 can be considered to be derived from this procedure. 

Except for this case, if a real multivariable controller is designed, the method 
of the characteristic loci allows us to approach the control system. 

The robustness analysis based on the study of the singular values should 
complete the synthesis in all cases, even if the study has been voluntarily simpli- 
hed. The pairing could be chosen as the one which gives the smallest modulus 
for the closed-loop transfer function relative to the disturbance. 
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8.7.1 Controller Tuning by the Largest Modulus Method 

The following method (Luyben, 1990, 1986) of search of the largest modulus is 
relatively simple to realize, presents the advantage of being easily understood 
and is parent to the Ziegler-Nichols method, which was used for single-input 
single-output controllers. It is decomposed into four stages: 

• Calculation of the parameters of the PI controller according to the Ziegler- 
Nichols method for each individual loop (ultimate frequency uOu and ultimate 
gain of each diagonal transfer function Ga] at the ultimate frequency, the 
phase angle is — tt and the ultimate gain is then the inverse of the real 
part of Gii. The gain of the controller recommended by Luyben (1990) is 
Kci = Kuil‘2.2 and the integral time constant is Td = 27r/1.2iOui- 

• A detuning factor F > 1 (from about 1.5 to 4) is chosen. All the controller 
gains are divided by F: Kd = KzNijF (a safety margin is ensured). The 
controller time constants are multiplied by the same factor F\ Td = tzni^F 
(also a safety margin by slowing the response). 

Represent the function 

W{juj) = -lFBet[lFGoi] (8.91) 

in the complex plane. The closer the function is to the Nyquist point (—1,0), 
the closer it is to instability. By analogy with closed-loop SISO systems, the 
closed-loop multivariable modulus (here expressed in decibels) is defined 

W 

L = 201og|^^| (8.92) 

which presents a maximum with respect to frequency. 

• The detuning factor is varied until the maximum of L: Lmax is equal to 
2n, n being the order of the multivariable system: for a SISO system, this 
corresponds to the usual recommendation Lmax = 2dB. 

This method of the largest modulus guarantees not only the stability of the 
control system in its environment, but also that of each controller considered 
individually. Following this method of the largest modulus, it is quite possible 
to pursue the improvement of the control system. 

8.7.2 Controller Tuning by the Characteristic Loci Method 

Maciejowski (1989) proposes the method of the characteristic loci in agree- 
ment with Grosdidier et al. (1985) (Fig. 8.10), which can be decomposed into 
successive stages 

• Calculate a constant compensator Ch ~ (corresponding to high 

frequencies) where is the loop passband, 

• Calculate an approximately commutative compensator Cm{s) at a medium 
frequency cOrn < for the compensated process G{s)Ch such that Cm{jbo) 

I when cj ^ oo (to avoid the infiuence of the high-frequency noise on the 
decoupling realized by Cm{s)). In this stage, the characteristic loci are con- 
sidered as well as the behaviour in the neighbourhood of the critical point 
(-1,0). 
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• In order to compensate for the possible steady-state errors, design an ap- 
proximately commutative compensator Ci{s) at a low frequency oui < oUrn 
for the compensated process G{s)C hCm{s) such that Ci{jio) I when 
uj ^ oo. The integral action integral can be introduced in Ci{s). 

• The complete compensator is equal to 

C{s) = ChCm{s)C,{s) (8.93) 



8.8 Discrete Multivariable Internal Model 
Control 

Among the different model-based controls can be found model algorithmic con- 
trol, dynamic matrix control and internal model control. However, Garcia and 
Morari (1985) showed that the first two types of control can be classified under 
internal model control (Fig. 8.13). 




Figure 8.13: Discrete multivariable internal model control 

Internal model control is developed for multivariable systems (Garcia and 
Morari, 1985) in the same way as for single-input single-output systems. Mul- 
tivariable internal model control is here presented with reference to the z- 
transform. If continuous transfer function matrices in s were used, the reason- 
ing would be similar, provided that the respective conditions of stability were 
kept. The input vector u of the system is equal to 

U=[I + C{z) (G(z) - G(z))]-1 G(z) {Yr{z) - D{z)) (8.94) 

and the output vector 

Y = G{z) [I + C{z) (G(z) - G(z))]-1 G(z) {Yr{z) - D{z)) + D{z) (8.95) 

If the model G and the process G{z) coincide perfectly, these equations are 
reduced to 

U = C(z) (Yr(z) - n(z)) (8.96) 
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and 

Y = G(z) C{z){Yr{z) - D{z)) + D{z) (8.97) 

The ideal controller corresponding to a zero steady-state error and a perfect 
disturbance rejection is obtained by taking 

C{z) = [G(z)]-i ^ C{z) = [G(z)]-i (8.98) 

In fact, such a controller is not physically realizable if the model G{z) 
contains delays or transmission zeros outside the unit circle. Moreover, even if 
this is not the case, if the transmission zeros are too close to unity (by a lower 
value), the controller will not be robust and the output will present oscillations. 

To avoid these problems, a factorization of the model transfer function is 
realized as 

G{z) = G+{z)G-{z) (8.99) 

so that Gj^{z) contains the delays and the transmission zeros. Thus, [G-{z)]~^ 
is a physically realizable controller and the controller is chosen as 

C{z) = [G-{z)]-^ (8.100) 



Delay factorization: 

If all the delays td were identical, the problem would be simple and would result 
in 

G+(z) = (8.101) 

where 1 unit is added to the delay to account for the fact that an output always 
responds to an input variation with a delay of at least one sampling period. 

In general, the delays are different and denoted by tdi so that for n outputs 
it results that 

G+{z) = diag \z~^rn+^) . . . (8.102) 

by setting 

t^. = max[max{0, tdij}] (8.103) 



The delays tdij are found from the inverse of the model matrix G(z)~^ whose 
elements are g^j^z) such that gij(z) is semi-proper (a ratio of polynomials 

gij(z) is said to be semi-proper when the order of the numerator is lower or 
equal to that of the denominator). 

In these conditions, in the absence of modelling errors, the output is equal 



to 



y = G(z)G_(z) ^ (y^{z) - d{z)) + d{z) 
= G+{z) {ydz) - d{z)) + d{z) 



(8.104) 



If the matrix Gj^{z) is diagonal, the closed-loop outputs are decoupled. 
Factorization of the transmission zeros out of the unit circle: 

The controller G- (z) must be stable. With the zeros of the model being poles 
for the controller, it is clear that these zeros must be located out of the unit 
circle. 




Process Control 



309 



G _ (z) is decomposed into a product of two factors and denoted by 



G-(z) — G-i(z) G-^i(z) 



(8.105) 



where G-i(z) contains the zeros inside the unit circle that will be kept for the 
controller, and G-^i(z) contains the zeros outside the unit circle that are added 
to 6+(z). 

To respect the wish that the matrix G-^i{z) is diagonal, the integral of the 
squared error criterion is minimized by the scalar diagonal matrix 

-«(=) = ft (S;) (H;) ' 



where Ui are the transmission zeros and are their images inside the unit circle 
such that 






Ui if \ui\ < 1 

^ if \vi\ > 1 



(8.107) 



In fact, the ideal decoupling by diagonalization is not optimal for the integral 
of the squared error criterion and other non-diagonal factorizations can be 
preferable. 
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Discrete-Time 
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Chapter 9 



Discrete-Time Generalities 
and Basic Signal Processing 



The use of computer or, more generally, discrete time implies large differences 
in the way of approaching the process control problem compared to continuous 
time. As a matter of fact, a non- negligible part of the study will have to be 
devoted to the realization of a measurement interface, allowing us to express the 
analog signals delivered by the sensors in a digital form adapted to calculation 
(analog to digital conversion: A/D), as well as to express in analog form the 
digital inputs delivered by the computer (digital to analog conversion: D/A). 
The computer always contains a supervising system, which is essential, and 
allows the operator to control or intervene in the process. 

When time is continuous and when the signal is continuous with regard 
to its amplitude, the signal is analog, such as the raw signal delivered by a 
physical sensor. Due to the data acquisition system, measurements are realized 
at regular time intervals. Time is no more treated as a continuous variable, 
but is discretized; for this reason, the corresponding time-dependent signals 
constitute variables. A discrete-time signal with a continuous amplitude is 
said to be sampled. Moreover, a consequence of use of computer is that the 
variable obtained by this means cannot take any value, as it is digitalized; 
between adjacent amplitudes there exists a finite nonzero interval related to 
the quantization (8 bits, 12 bits or 16 bits, for example), and this is called 
quantization of the variable. A digital signal is constituted by a quantized 
variable in discrete time. 

Digital signals need the use of the z-transform, which plays the role of the 
Laplace transform for continuous signals. 
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9.1 Fourier Transformation and Signal 
Processing 

The important use of Fourier transformation in signal processing justifies a 
short review at this stage. 



9.1.1 Continuous Fourier Transform 

The Fourier transform of a continuous function f{t) is defined by 

/ -|-oo 

f{t)exp{-j2Tn^t)dt (9.1) 

-OO 

while the inverse Fourier transform is defined by 

/ -|-oo 

f{i^) exp{j2Tu^t)di^ (9.2) 

-OO 

Notice that passing from the Fourier transform to the Laplace transform is 
simply performed by changing j27ru = jco into s in the integral term. 

The main properties of the Fourier transformation are the following: 
Linearity: 

The Fourier transformation is a linear application: 

T{af{t) + (3g{t)) = a'f{v) + (3g{v) (9.3) 



Transformation sin-cos of the function f{t): 

Any function f{t) can be decomposed into the sum of an even function e{t) 
and an odd function o{t) 



m = 


e{t) + 


o{t) 






even 


odd 




with 










and 


o{t) = 


The Fourier transform of / is 








nOO 




pOO 




f{iy) = 2 / e{t) cos{27uyt)dt — 

Jo 


i2 

Jo 


o{t) sm{27rut)dt 



= d^cos{f{t)) 



(9.4) 



(9.5) 



Thus, the Fourier transform of function f{t) is decomposed into a sum of two 
terms, one being the cosinus transform of the even part and the other being 
the sinus transform of the odd part. 

Consequences: 
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m 


Fourier 


/(O 






transform 






even 


— > 


even 




odd 


— > 


odd 




real 

imaginary 


— ^ 


hermitian (i.e.: f{u) 
skew-hermitian (i.e.: 


= fi-V) 

/>) = -/(-O) 


Transposition: 




Hfi-t)) = fi-V 


(9.6) 


Conjugate: 




HJit)) = fi-V 


(9.7) 


Scale change: 




Hfiat)) = 


(9.8) 



A compression of the timescale t induces a dilatation of the frequency scale 
(and reciprocally), due to the duality time- frequency. 

Time translation: 

T{f(t -a)) = ey:p{-j2Trua)f{u) (9.9) 

The Fourier transform of the time-shifted function has its module unchanged, 
only its argument is modihed. 

Frequency modulation: 



J^{exp{j2Tuyot) f{t)) = f{v - i^o) (9.10) 

Modulating a function f{t) by an imaginary exponential amounts to translating 
its Fourier transform. 

Derivation with respect to variable t: 

Assume that f{t) is integrable, differentiable, with integrable derivative. 

= j27Ti^-^(/W) = J2t:v f{v) (9.11) 



and by iteration 



it)) = {j 27 Tumm) = {j27Turf{u) ( 9 . 12 ) 

The following inequality results 

I \f^^Ht)\dt>\{27rur\\f{u)\ (9.13) 

The more differentiable / is, with integrable derivatives, the faster / decreases 
towards infinity. 

Derivation with respect to frequency u: 



-J-JiV = •^((-J27rt)/(t)) 



(9.14) 




316 



Chapter 9. Discrete- Time Generalities and Basic Signal Processing 



the previous derivation is justified if tf{t) is integrable (thus f{t) decreases 
faster than 1/t^). 

More generally 

/(-)(;.) =^((-j27Tt)-/(t)) (9.15) 

resulting in 

!/(-)(;.) I < J \f{t)\\ 27 Ttrdt (9.16) 

the more f{t) decreases towards infinity, the more differentiable f{u) is (with 
bounded derivatives). 

Fourier transform of the eonvolution produet: 

the Fourier transform of the convolution product (denoted by *) of two func- 
tions is equal to the product of the Fourier transforms of both functions 

nm * g{t)) = nm) ( 9 . 17 ) 

Relation of Parseval-Planeherel: 

/ +00 ^+00 

\f{t)\^dt= \fU)\'^di' (9.18) 

-00 J — 00 

It expresses that the total energy of signal f{t) (represented by the left member) 
is equal to the sum of the energies of the signal components (right member). 



Energy Properties 

Total finite energy signals: 

The functions f{t) of the integrable square |/(t)P play an important role in 
physics as, when t is the time variable, |/(t)P represents, in general, an energy 
per time unit (power). The integral 




(9.19) 



represents the total dissipated energy. In physics, the integrable-square func- 
tions are, in general, called total finite energy functions. 

The mean quadratic deviation is defined by 



i:::e\m\^dt 



(9.20) 



and represents an image of the concentration of the energy in the neighbourhood 
of 0. 

The uncertainty relation (Roddier, 1971) 



V<t‘^> V < > > 



1 

47T 



(9.21) 
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means that if f{u) is searched for a given frequency z/q, then one must seek 
in the past history and the future of the signal f{t) for what corresponds to 
z^o; this corresponds to an inhnitely selective hltering which is impossible: f{u) 
cannot be perfectly known. Reciprocally, to know f{t) from ^(z^), an inhnite 
bandwidth is necessary. 

By setting the Fourier transform of f{t) equal to 



/(«/) = A{iy) exp(ja(j/)) (9.22) 

the module A{u) is positive real and the argument a{u) is also real. Thus, 

|/>)|2 = a2(^) = $0(^) (9.23) 

The energy spectral density ^j(z^) (also called energy spectrum) does not de- 
pend on the signal phase spectrum, thus is not modihed by a translation of the 
signal on the time axis. 

The autocorrelation function of the hnite energy signal f{t) is dehned by 

/ + 00 

fit) fit - Odt (9.24) 

-OO 



where f{x) represents the conjugate complex function of f{x). The Fourier 
transform of the autocorrelation function of the hnite energy signal f(t) is 
the energy spectral density ^j(z^). The autocorrelation function of a periodic 
function of period T is also a periodic function of period T. The autocorrelation 
function can be normalized 



r(^) 




(9.25) 



F((f) is called the coherence or degree of self-coherence: 0 < F((f) < 1. 

The cross-correlation function of the hnite energy signals f{t) and g{t) is 
dehned by 




fit) git - Odt 



(9.26) 



The Fourier transform of the cross-correlation function of the hnite energy 
signals f(t) and g(t) is the cross-spectral density ^j^(z/). The cross-correlation 
function can also be normalized 







(9.27) 



the function F fg{^) is the coherence degree between / and g. 

Finite mean power signals: 

Some signals such as f{t) = a cos(cjt) do not have a hnite energy. In this case, 
the mean power transported by the signal is considered 

T 



liniT^oo 



(9.28) 
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The previous properties are extended to finite mean power signals, e.g. the 
autocorrelation 

T 

= liniT^oo ^ J ^ ~ 

Periodical signals: 

Let f{t) be a periodic function of period T. In a general manner, it can be 
shown that f{t) can be expanded as a series of orthogonal^ functions in the 
considered interval PT] 

n=-\-oo ^ 

f{t) = ^ c„exp(j27rn-) (9.31) 

n=— oo 

where the coefficients Cn are equal to 

1 ^ 

Cn = — J fit) exp(-j27rn-)(it (9.32) 

When the function f{t) is real, the Fourier series expansion is written as 



./ . ao .27t A . . 

/(t) = y+ ^ f anCos(— nt) + 6nSm(— nt) j (9.33) 

n=l ^ ^ 



or 



n=-|-oo 



/w = T+ E (oLn COs(27TZ/o^O + sin(27TZ/o^O) (9.34) 

^ n=l 

by introducing uq = 1/T. 

The coefficients and bn can be calculated according to the following 
relations 

/ X 

f{t) cos{27nyont)dt 



bn = T /(O sin(27TZ/o^O^^ 



(9.35) 



Set: /n = /(^^o) = ^ (^n — jbn)‘ If can be noticed that the coefficient 
previously defined is equal to f{nuo). The Fourier transform f{u) of function 
f{t) is then equal to 



n=+oo 

fi^)= E finuo)5{u -nuo) (9.36) 

n=— OO 

^The functions f{t) and g{t) are defined as orthogonal in the interval [a, 5] if their scalar 
product is zero 

<f,9>= [ w{t) f{t)g{t)dt = 0 (9.30) 

J a 

where w{t) is a weight function [a, 5] over a particular interval, both depending on the con- 
sidered type of function. Examples of orthogonal functions are Walsh functions (periodic, 
binary (= — 1 and +1), called square waves in electronics), Legendre polynomials, Laguerre 
polynomials, Chebyshev polynomials and Hermite polynomials. 




Process Control 



319 



/(nz/o) is the frequency spectrum, which is decomposed into the amplitude 
spectrum 

|/(nt'o)| = (9.37) 

which is an even function, and its phase spectrum 

(t){nuo) = arctg (9.38) 

V / 



which is an odd function. 

The spectrum of a periodic function of period T is a discrete spectrum, thus 
a discrete function whose minimum interval on the frequency axis is: uq = 1/T; 
it is discontinuous (it exists only for multiples of the fundamental frequency uq). 
Non periodic signals: 

The non-periodicity of the studied function can be considered as the fact that 
when T ^ (X), then z/q ^ 0, the spectrum becomes a continuous function. The 
function f{t) is deduced in a form analogous to that given for periodic functions 

/ +00 r-\-oo 

exp(j27TZ/t)dz/ / f{u) exp(— j27rz/n)dn (9.39) 

-oo J — oo 



and the Fourier transform 




f{t) exp{—j27rut)dt 



(9.40) 



Fourier transform of signals known on a limited interval: 

Here, only physical signals are concerned. As they are known in a limited 
interval, these signals have necessarily a hnite energy. 

Two cases are often met: 

a) The function represented in known interval [0,T] as the considered signal 
is assumed to be zero outside, thus its Fourier transform f{u) is a continuous 
function with respect to u. 

b) The function represented on known interval [0,T] as the considered signal 
is assumed periodic with period T. Its Fourier transform f{u) is a discrete 
spectrum with rays An = 1 /T, each ray being equal to 



fn = fiV with: = n/T (9-41) 

The larger the period T, the larger the density of the discrete spectrum. The 
Fourier transform is equal to 



n=+oo n=+oo 

/»= ^ fnSU-^)= Y. 



(9.42) 
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9.1.2 Discrete Fourier Transform 

Obtaining the discrete Fourier transform (Fig. 9.1) can be realized (Brigham, 
1974) according to the following procedure 

1. First, consider the continuous signal f{t). 

2 . Sample this signal with a Dirac comb St^ of period Tg (this is a periodic 
train of Dirac or ideal impulses), of amplitude 1; the sampled function is 
thus obtained 

-|-oo 

fit) St, = finTs)S{t - nTs) (9.43) 

n=— oo 

Note that a bad choice of the sampling period can generate spectral aliasing. 

3. This function is truncated by multiplying by a rectangular pulse function 
g{t) of length amplitude 1 , not centred in 0 , not starting at 0 , but at 
—Tg/2 (this allows us to keep the signal at 0). N values are obtained such 
that 

m St, g{t) = E(^r-oo fi^Ts) S{t - nTs) g{t) 

= ENo finT,)Sit-nT,) 

with N = DjTg. We could consider that this sampled set of N 
represents a pattern of a periodic function defined in [— oo, Too]. 

4. The Fourier transform of this sampled set of N values is equal to 

/(^) = ^[ENo fi^T,) Sit - nTs)] 

= ENo finTs)mt-nTs)] 

= Y)n=o fi'^Ts) exp(-j27T!^nTs) 

The previous Fourier transform f{u) was a continuous function with respect to 
frequency z/, periodic (with its period equal to Ug = I/F 5 ), completely defined 
by the N values of the series (9.45). The time-truncation by the rectangu- 
lar pulse function causes frequency oscillations, called the Gibbs phenomenon 
(Kwakernaak and Sivan, 1991). This continuous Fourier transform is itself 
sampled at frequencies Uk = kAu (with Au = 1 /(A^T 5 )), called the harmonic 
frequencies of the discrete Fourier transform. The discrete Fourier transform 
(DFT) is defined (Kunt, 1981) as the series 



(9.44) 

values 



(9.45) 



A^-l 

/(^) = N expi-j2TTkn/N) (9.46) 

n=0 

where /(n) represents the value of function f{t) at time nTg. f{k) thus is an 
element of a periodic series of N elements defined in a frequency bandwidth 
Ug = 1/Tg and separated by the frequency increment Au = 1 /(A^T 5 ). 

The inverse discrete Fourier transform is defined as the series 
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Frequently, the sampling period is normalized: T 5 = 1, resulting in slightly 
simplihed formulae. 

With the series /(n) being periodic, it is possible to make it start at any 
instant, which gives 



no+A /'— 1 

/(^) = f{n) exp{-j2TTkn/N) (9.48) 

n=no 

In general, the time origin is chosen as the hrst instant, giving formula (9.46). 

Similarly, as the series f{k) is periodic, it is possible to make it start at any 
frequency. Often, the main period of the complex spectrum is chosen as the 
interval [— A^/2, A^/2 — 1], hence the inverse discrete Fourier transform 

1 

/(n) = — f{k) exp{j2Trnk/N) (9.49) 

k=-N/2 

The number of points N must be large and, in general, are taken as a power 
of 2 (e.g. N = 1024). In these conditions, the discrete Fourier transform is 
equivalent to the Fourier transform of the continuous signal. The fast Fourier 
transform (FFT) corresponds to a fast computing algorithm of the discrete 
Fourier transform. 

Some properties of the discrete Fourier transform are: 

• The discrete Fourier transform is a linear application. 

• The discrete Fourier transform is an unitary operation^, i.e. it keeps the 
scalar product 

</i,/ 2 >=</i,/ 2 > (9.50) 

This property induces the Parseval-Plancherel relation 

ll/f = WfT (9.51) 

• The signal energy (cf. Parseval-Plancherel relation) is equal to 

N-l N/2-1 

E !/>)!' (9.52) 

n=0 k=-N/2 

i.e. the sum of the energies of the signal frequency components i>k = 
k/{NTs). Each frequency contribution \f{k)\‘^ is sometimes called a pe- 
riodogram (Ljung, 1987), and constitutes an estimation of the spectral den- 
sity, however biased (Soderstrom and Stoica, 1989). 

• The discrete convolution product y of two signals u and g is dehned by 

A^-l 

y{n) = u{i)g{n — i) , n = 0, . . . , — 1 (9.53) 

^=0 

2 A linear transformation defined in a vector space is unitary if it keeps the scalar product 
and the norm. 
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• The discrete Fourier transform of the convolution product is equal to the 
product of the discrete Fourier transforms 

y(k) = u{k)g{k) (9.54) 

• The time translation m induces a phase rotation of the discrete transform 

fm{k) = /(n — m) exp(— j27r/cn/A^) = /(/c) exp(— j27r/cm/A^) (9.55) 

n=0 



• The discrete Fourier transform verifies 

fi-k) = J{k) (9.56) 



Remark : 

Kwakernaak and Sivan (1991) call the discrete Fourier transform previously 
defined “Discrete to discrete Fourier transform”’, to emphasize that the discrete 
signal /(n) is used to build the discrete signal f{k). This transform uses an 
expansion on an orthogonal basis. By using an expansion on an orthonormal 
basis with respect to the scalar product, they define, as Ljung (1987), the 
discrete Fourier transform as 

1 

f{k) = f{n) exp{-j2TTkn/N) , k = 0,...,N-l (9.57) 

and the inverse discrete Fourier transform 
1 

fin) = —^ W f{k) exp{j2Trnk/N) , n = 0, . . . , - 1 (9.58) 

V ^ k=-N/2 

Moreover, Kwakernaak and Sivan (1991) define the sampled discrete Fourier 
transform as 



(N-l)Ts 

f{'^)=Ts fit) exp{-j2TTut) , u = 0,...,iN -l)/iNTs) (9.59) 

t=0 

and the inverse sampled discrete Fourier transform as 

^ (7V-1)/(7VTD 

fit) = fii2) expij2TU2t) , t = 0, . . . , (AT - l)Ts (9.60) 

^ u=0 

These various transforms are, of course, also unitary. 

9.1.3 Stochastic Signals 

A signal is nearly always noisy. The noise is a stochastic (random) phenomenon 
which is superposed to the source signal of interest to the user. The noise and 
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the noisy signal constitute two stochastic signals. In continuous time, the signal 
is denoted by x{t). In discrete time, the signal is denoted by x(n), where n is 
the instant. With the stochastic signal x{n) is associated a distribution function 
/(x, n) equal to 

/(x, n) = V{x{n) < x) (9.61) 

where V{x{n) < x) = V{x^n) is the probability that the variable x{n) is lower 
than a given value x. The probability density p(x, n) is the derivative of the 
distribution function, dehned as 

= (M2) 

The main statistical variables characterizing the random signal x{n) are: 



• The hrst-order moment or set mean or mathematical expectation of the 
signal resulting from a large number of observations 



/ oo 2 ^ 

x{n)p{x^n)dx{n) = lim — / Xi{n)V{x^n) 

-oo ^ 

(9.63) 



i=l 



The time mean of the signal 



1 /.T ^ N 

^^x=Vlm - x{t)dt= lim (9.64) 

T^oo 1 /n N^oo iV ^ ' 

i=l 



The second-order moment 



/ oo 

x‘^{n)p{x,n)dx{n) (9.65) 

-oo 

• The second-order moment centred with respect to the mean, or variance 

/ oo 

[x{n) — px{n)]‘^p{x,n)dx (9.66) 

-oo 



When the stochastic signal x is considered at two different instants ni and n 2 , 
the distribution function of the stochastic variables x{ni) and x{ri 2 ) is defined 
by 

/(xi,X 2 ,ni,n 2 ) = V{x{ni) < xi]x{ri 2 ) < X 2 ) (9.67) 



where V{x{ni) < x\]x{n 2 ) < X 2 ) = V{x\^X 2 ) is the joint probability (or 
second-order probability) that the signal x at time ni is lower than a given 
value xi, and that x at 1 x 2 is lower than a given value X 2 - The joint probability 
density results 



p(xi,X2,ni,n2) 



a^/(xi,X2,ni,n2) 

dx\dx2 



(9.68) 



The associated statistical properties are: 
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• The autocorrelation function (denoted by or R) of the signal x{n) 

/ OO nOO 

/ xiX 2 p{xi, X 2 , ni, ri 2 )dxidx 2 

-OO j — OO 

(9.69) 

• The autocovariance function of the signal x(n) 

Cxx{ni,n2) = E[{x{ni) - fix{ni)}{x{n2) - Ma(«2)}] 

= - f^x{ni)][x2 - ^ix{n2)]p{xi,X2,ni,n2)dxidx2 

= Rxx{ni,n2) - Px{ni)px{n2) 

(9.70) 



9.1.4 Stochastic Stationary Signals 

A signal is stationary in the strict sense if all its statistical properties are time- 
independent. It is stationary in the wide sense if its first two moments are 
time-independent. The notations are then simplified: 

• The probability density, the mean, the variance are, respectively, p(x), /i^,, 

• Noting that r = t 2 —ti, the autocorrelation function and the autocovariance 
function are, respectively Rxx(^) and Cxx{^) = Rxx{^) ~ Px’ 

The autocorrelation function does not depend anymore on time t, but only on 
the time shift r. 

A signal is ergodic when the mean values (time averages) are equivalent to 
the corresponding expectations (set averages), thus for a moment of any order i 

E[x'^] = / x'^p{x)dx = x'^ = ^im — x^(n) (9-71) 

Thus, the autocorrelation function of a stationary and ergodic signal is equal 
to 



^ N-\n\-l 

RxxM = E[x(i)x(i-\-n)] = x(i)x(i + n) = — ^ x(i)x(i-\-n) (9.72) 

N — n 

' ' ^=0 

and the autocovariance function of a stationary and ergodic signal is equal to 

Cxx(n) = [x(i) - px] [x(i + n) - px] = Rxx(n) - pi (9.73) 

Recall that the power spectral density ^x of a signal x is the Fourier transform 
of the autocorrelation function of this signal 

OO 

^x(k) = Rxx(n)exp(-j2Trkn/N) 



n=— OO 



(9.74) 
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The cross-correlation function of the stationary and ergodic signals x and 
y is equal to 



^ N-\n\-l 

Rxy{n) = E[x{i)y{i + n)] = x{i)y{i + n) = — — — V x{i)y{i + n) (9.75) 

i\ — n 

' ' ^=0 

The two signals are not correlated if the cross-correlation is always zero. The 
cross-spectral density ^xy of fho signals x and y is the Fourier transform of the 
cross-correlation function of these signals 

oo 

^xy{k) = Rxy{n)exp{-j2TTkn/N) (9.76) 

n=— OO 

The cross-covariance function of the stationary and ergodic signals x and y 
is equal to 

Cxy{n) = E[{x{i) - Px}{y{i + n) - py}] = Rxy{n) - px^y (9.77) 

9.1.5 Summary 

Consider a variable such as signal energy. The expression of this variable differs 
according to the type of signal that is considered and whether time is continuous 
or discrete. Table 9.1 resumes the different cases that can be encountered. It 
is possible to have the same reasoning for other quadratic variables such as 
power, scalar product, correlation (see Table 9.2), covariance, spectral density, 
etc. 
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9.2 Sampling 

9.2.1 D/A and A/D Conversions 

Suppose that the signal delivered by the sensor is analog. This signal is not 
usable by the computer and must be transformed: using a clock, a given num- 
ber of signal values are withdrawn, thus the analog signal is sampled with a 
sampling period At (Fig. 9.2) or a sampling frequency 1/At. The signal thus 
obtained consists of a sequence of discrete values separated by a time interval 
At (impulse modulation). The sampling operation induces a loss of informa- 
tion: if the sampling is regular, no information exists between two successive 
equidistant instants. A multiplexer allows the acquisition of several signals 
coming from different sensors, of which only a small proportion will really be 
used for control, the remaining being used for monitoring. Note that, in prac- 
tice, the sampling is frequently realized either with different sampling periods 
according to the signals, or with time- varying sampling periods. 




Figure 9.2: Sampling of a signal 

Conversely, if the original signal is a digital signal delivered by the computer 
and is to be transformed into an analog signal, a holder is used, e.g. a zero- 
order holder, also called a boxcar which keeps the signal value constant during 
the period At 

yb{t) = Vn-l = y{tn-l) for tn -1 < t < tn (9.78) 

The digital signal (Fig. 9.2), consisting of a series of impulses is thus discrete; 
it becomes continuous after the holder (signal reconstruction), and consists of 
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a series of steps. 

Other types of holder can be used, such as the first-order holder, which 
linearly extrapolates the signal in the period by using the increase 

during the previous period 

yb(t) = Vn-l + - yn-2) for tn-l < t < tn (9.79) 

In fact, the more frequently used holder in process control remains the zero- 
order holder and we will assume that we always use this type. The general 
representation of the digital control loop is given by Fig. 9.3. 



Computer 




Figure 9.3: General representation of the digital control 



9.2.2 Choice of Sampling Period 

Consider the case of a sinusoidal signal (Fig. 9.4): we notice that if the sampling 
period is equal, for example, to the period of the sinusoid itself, the sampled 
signal contains information which will all have the same value, as if it were 
coming from a flat constant signal. This phenomenon is due to the sampling 
period being too large with respect to the frequency band of interest for the 
observed signal. 




Figure 9.4: Sampled sinusoidal signal with an ill-chosen sampling period 

A sampler is obtained (Fig. 9.5) by multiplying the input signal y(t) by a 
sampling function s(t), which is a periodic sequence (period Tg = 1/i^s) of 
rectangular impulses of duration D. The sampled signal is thus equal to 



ys{t) = y{t)s{t) 



(9.80) 
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Imagine that the impulse duration D is inhnitely short; the sampling func- 
tion s{t) then consists of a periodic train of ideal impulses (period T^) denoted 
by 

n=+oo 

s{t) = 5ts = 5{t — nTg) (9.81) 

n=— oo 



s{t) 



y{t) / 



ys{t) 



Figure 9.5: Sampling of a signal by a periodic train of ideal impulses 
The ideally sampled signal ys{t) can be represented by a series 



n=+oo 

ys{t) = XI y{nTs)S{t - nTs) (9.82) 

n=— OO 

The Fourier transform of the sampled function yg (t) is equal to the convolu- 
tion product of the Fourier transform of the signal y{t) by the Fourier transform 
of the sampling function s{t) 



ysii^) = yii^) * s{iy) (9.83) 

The Fourier transform of the periodic train of ideal impulses is equal to 

(9.84) 

which is itself a periodic train of ideal impulses, of period z/ 5 . The Fourier 
transform of the sampled function yg (t) results 

ys(i^) = y{v)*vs5i,X'^) 

^ (9.85) 

E n=+oo \ ' ^ 

n=-oo - nVs) 

This function ^sy{^ ~ is a periodic function of period Ug and 

amplitude Ug\y{u)\. The spectral density 4>^^(z/) (Fig. 9.6) is equal to 

n=+oo 

E (9.86) 

n=— 00 

Thus, it has the same period as the Fourier transform ys{i^)- The functions 
ys{i^) and 4>^^(z/) are, respectively, the repetition of the motives of Fourier 
transform y{u) and spectral density 4>^(z/), respectively multiplied by Ug and 
z/g . The frequency z^max of Fig. 9.6 corresponds to the bandwidth of the process 
i.e. the frequency above which the amplitude of the spectral density becomes 
negligible. 
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V 




Figure 9.6: Spectral densities for the studied signal and the corresponding 
ideally sampled signal 



This case was an ideal view; in reality, the impulse has a finite duration and 
the value obtained during duration D is held, or rather, an average is realized 
in this time interval. Thus, the first value obtained at t = 0 will be kept during 
D and the first rectangular impulse is centred at D/2] the following impulse 
starts after a period Tg and has the same duration and so on (Fig. 9.7). 




T " D 



Figure 9.7: Real sampling of a signal 

In a similar manner to the ideal case, the Fourier transform of the sampled 
function ys{t) is to be calculated. The Fourier transform of a rectangular 
impulse function of duration centred at to (here: D/2) is 

jF[rect( ^ j^^ )] = Dsmc{iyD) exp{—j27nyto) (9.87) 



where sine represents the cardinal sinus function equal to sinc(x) = sin(7rx)/(7rx). 
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The Fourier transform of the sampled signal is 

ys(0 = y(0 * '^s-C»sinc(nr»!^s) exp(-j7r!^r»)] 

_ (9.88) 

= J2n=-'^ VsDsmc{nDvs)y{v - nvs) eyip{-j'KvD) 

We thus notice that the Fourier transform of the sampled signal depends on 
the sampling frequency i>s] h is no more a periodic function, but is attenu- 
ated by the presence of the sinus cardinal term. The exponential term is the 
consequence of the delay effect introduced by D/2. 

The spectral density of the sampled signal is not limited with respect 

to frequency, but extends in the whole spectrum (Fig. 9.8). 





Figure 9.8: Spectral densities for the studied signal and the corresponding 
sampled signal 

The original spectrum y{u) of the analog signal is limited at the maximum 
frequency z^max- The spectrum of the sampled signal ys{i^) is a function (term: 
z/ 5 lysine (nI^z/ 5 ) exp (—ZTTZ/I^)) of the periodic repetition (term: “ 

rws) ) of fho original spectrum y{i>) of the analog signal. If the term of the func- 
tion were equal to 1 , then the spectrum would be strictly periodic. According 
to the value of the maximum frequency i^rnax corresponding to the extent of the 
spectrum of the analog signal, the sequences of the sampled signal spectrum 
either partly overlap themselves (Fig. 9.9) or do not overlap. If overlapping, 
also called aliasing, occurs, then the transformation is non-reversible, thus the 
reconstruction is not possible, which means a loss of information. 

We assumed that the original spectrum of the analog signal was not fre- 
quency-limited; if that were not the case, frequency aliasing would occur for 
any sampling, even with ideal sampling. To avoid this problem, in practice, a 
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Figure 9.9: Frequency overlapping of the spectral densities: aliasing 



convenient filter such as a high-order Butterworth filter is placed upwards with 
respect to the sampler in order to eliminate the higher frequencies. 

A physically realizable signal (finite-energy signal) necessarily has an infinite 
extent spectrum. Thus, this would pose a problem concerning the sampling; 
but, so that the energy is finite, the spectrum must tend towards 0 when the 
frequency tends towards infinity. Thus, the spectrum will be nearly zero 
above some frequency; to reconstruct the signal as close as possible to y{t)^ 
the high-frequency components of ys{i^) must be eliminated by a lowpass filter. 
The aliasing will be negligible provided that the sampling frequency is well 
chosen as the Shannon theorem indicates. 

Shannon sampling theorem: 

The Shannon theorem provides rules which allow us to guarantee minimal in- 
formation losses caused by sampling: the analog signal y{t)^ having a lowpass 
spectrum (band-limited signal) (maximum frequency z^max), is totally described 
by the sequence of instantaneous values y{tk) periodically sampled provided 
that the sampling period Tg is lower than or equal to l/(2i>rnax) 



T. < 



2Ury 



or Ua > 2Urr 



(9.89) 



In practice, the sampling frequency Ug must be largely greater than nmaxi 



e.g. 

Vs > ^Vmax (9.90) 

In the case of an ideal sampler, the ideal lowpass filter would be a filter 
whose amplitude of the harmonic response is the rectangular function 



\G(y)\ = Tsrect(— ) 



(9.91) 



When the spectrum of the signal to be sampled has a wide band due to the 
environment noise (sensors, electronics, etc.), a prefilter is placed before the 
sampler; this is called an anti-aliasing filter (Fig. 9.10), the function of which 
is to filter any frequency higher than z^s/2. Ideally, it would be a lowpass 
filter limited to a passband B (theoretically equal to the frequency nmax)i 
indeed practically extending up to . The difference Bj^ — B constitutes the 
transition band. Max (1985) recommends as an anti-aliasing filter the elliptic 
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Cauer filter, of order n = 2m, composed of m elementary filters in series, each 
having as a transfer function 



G(.) = (9.92) 

' + -^'2^+ (2^ 

where uq and ui are characteristic frequencies. The Butterworth hlter (Mi- 
tra and Kaiser, 1993) is frequently cited, but its transition from bandpass to 
bandstop is less stiff. 



s(t) 



y(t) 



51 W 



51 W 




92(t) 



ys(t) 



|Gi(OI 



1 




B B, 






Figure 9.10: Anti-aliasing hltering 

The sampling theorem cannot be applied without looking at the type of signal 
spectrum; that theorem was valid only for a lowpass signal; for a bandpass 
signal, the theorem can be expressed slightly differently. Suppose that the 
signal spectrum is bounded by low frequency I'min and by high frequency i^rnax ] 
the bandwidth B is equal to B = Umax ~ ^min- Let m be the largest integer 
that is lower than or equal to Umax! B. 

To guarantee non-spectral overlapping, the sampling frequency must be 
larger than or equal to 2vmax!^ 

Us > (9.93) 

m 

It is possible to notice that if the higher frequency Umax is much larger than 
the bandwidth 5, the minimum sampling frequency becomes close to 2B 

If ^max > B Us > 2B (9.94) 

Now we will discuss the practical choice of sampling period with respect to 
the real process characteristics: 
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• Too slow a sampling will have the drawback of reducing the efficiency of the 
feedback with respect to disturbances. 

• Too fast a sampling will needlessly overload the computer memory. More- 
over, when the signal to noise ratio (ratio of the signal variance over the 
variance of a random input disturbance or of a noise) is low, oversampling 
provokes a large influence of the noise: then it is necessary to filter. 



Table 9.3: Recommendations for the choice of sampling period 



Physical system 


Sampling period 


or physical variable 


Tg recommended 


Electrical engines 


Ts < 0.1 s 


Flow rate 


1 s 


Level 


5 s 


Pressure 


5 s 


Temperature 


20 s 


Open-loop system: 


Recommended 


characteristic variable 


sampling period 


First-order with time constant r 


0.25r < Tg < T 


First-order with time delay td 


0.2td <Tg < td 


Second-order of natural frequency 


0.05/cJn <Tg < l/cJri 


^max- dominant time constant 


Tg <C O.lTmax 


Settling time tg 


tg/lb <Tg< tg/6 


Critical frequency cOc 


0.15/cJc < Tg < 0.5/cJc 


Closed-loop system: 


Recommended 


Characteristic variable 


sampling period 


Integral time constant r/ 


Ts > Tj/100 


Derivative time constant Td 


O.lTd <Ts < 0.5rd 



Table 9.3 (Flaus, 1994; Seborg et ah, 1989), gives some indications concerning 
the recommended sampling frequencies which are valid for a large number of 
chemical processes. However, these values may differ according to the pro- 
cess, especially concerning the cited physical variables, where the user will be 
essentially guided by the process time constants. 

9.3 Filtering 

The noise which affects analog signals can come from different sources: the 
measurement device, the electrical environment, the process, etc. The electrical 
noise can be minimized by shielding the cables, but the noises coming from the 
measurement and process must be minimized by filtering (Fig. 9.11), which 




Process Control 



337 



transforms a noisy input signal y(t) into a filtered signal yf(t) coming out of 
the filter. A basic analog filtering is recommended, i.e. before data acquisition, 
where the signal can be later digitally hltered. 



yit) 



Filter 



yf{t) 



Figure 9.11: Filtering 

Recall that the choice of sampling period is essential; to avoid the problems 
related to a too-low sampling rate, data signals are prehltered by means of 
an anti-aliasing hlter which will be used systematically in order to hlter high- 
frequency noise. 

It is not possible to present all existing hlters, but rather to show some 
simple principles concerning elementary hlters; for more complex hlters (But- 
terworth, Chebyshev, Bessel, Cauer, . . . , adaptive hlters), the reader can refer 
to the following texts: Crochiere and Rabiner (1983); Kunt (1981); Mitra and 
Kaiser (1993); Proakis and Manolakis (1996); Rorabaugh (1997); Salman and 
Solotareff (1982). In particular, the transformation formulae from a given ana- 
log hlter into a digital hlter of specihed bandwidth will be found (lowpass, 
bandpass, bandstop, highpass, notch). Bellanger (1989); Haykin (1991); Tre- 
ichler et al. (1987); Vaidyanathan (1993) treat adaptive hlters. 

9.3.1 First-Order Filter 

A hrst-order exponential hlter will make a noisy measurement signal smoother. 
The differential equation corresponding to the hrst-order transfer function for 
the hrst-order analog hlter is 



+ ( 9 - 95 ) 

This hlter is represented by a continuous hrst-order transfer function equal to 

Gf{s) = FM = 1 (9.96) 

where r/ is the hlter time constant and the hlter gain is equal to 1. This 
hlter dampens the high-frequency huctuations: it is a lowpass hlter currently 
called RC hlter because of its electrical scheme. The hlter time constant r/ 
must be far smaller than the dominant time constant of the process Tmax to 
avoid a lag in the feedback loop (e.g. r/ < O.lrmax)* If the noise amplitude is 
large, r/ must be increased. On the other hand, the noise bandwidth must be 
considered: let be the lowest frequency of the noise, r/ must be chosen so 
that i^f < lyrn where i^f = 1 / t / (the hlter dampens signals of frequency higher 
than z/j, i.e. the high-frequency noise). By dehning z^max = 1/^max, it results 
that for the hlter 



^max <Vf <Vm 



(9.97) 




338 



Chapter 9. Discrete- Time Generalities and Basic Signal Processing 




Figure 9.12: First-order filter 

It is possible to consider a digital first-order filter associated with Eq. (9.95), 
for example, by discretizing this equation using a backward finite difference 
scheme. Denote by subscript n the values at time t, by y{n) the filter input and 
by yf{n) the filtered output (Fig. 9.12). The discretized differential equation 
becomes 

^^ yf{n) Mn — ])_ ^ ^ 

which can be ordered as 

yf{n) = ay{n) + (1 — a)yf{n — 1) with: ^/(O) = ^(0) (9.99) 

where 

“ = Tf/At + l (0 < “ ^ 1 ) ( 9 - 100 ) 

The limit a = 1 corresponds to no filtering (zero time constant). The limit 
ce = 0 does not take into account the measurement. 

9.3.2 Second-Order Filter 

The second-order filter is equivalent to two first-order filters in series. Let 
yf 2 {^) be the signal coming out from the second filter and yfi{n) from the first 
filter (Fig. 9.13). 




Figure 9.13: Two first-order filters in series 

The equations for both filters are 

y/i(n) = aiy{n) + (1 - o;i)y/i(n - 1) (9.101) 

yf2{n) = a2yfi{n) + (1 - a2)yf2{n ~ 1) (9.102) 

giving the equation for the set of filters 

yf2{n) = aia2y{n) + {2-ai-a2)yf2{n-l)-{l-ai){l-a2)yf2i.n-2) (9.103) 

which corresponds to a second-order filter. The second-order filter is a better 
filter of high-frequency noise than the first-order filter. 
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9.3.3 Moving Average Filter 

The moving average or smoothing hlter realizes the average of the k last points 
with a hxed weight 

1 "" 

Vfin) = ^ y(i) (9.104) 

i=n — k-\-l 

giving the recursive equation 

y/(n) =y/(n-l) + i[y(n) -y(n-fc)] with: y/(0) = y(0) (9.105) 

In this form, this hlter would present a bias due to the initial value. For this 
reason, it is, in general, used with a forgetting factor (ce < 1 and close to 1) 

yf{n)=ayf{n-l) + ^[y{n)-y{n-k)] with: y/(0) = y(0) (9.106) 

This hlter which is a lowpass hlter such as the exponential hlter is, in general, 
less efficient than the exponential hlter, which gives more weight to the last 
measurements. The smoothing hlter can be calculated by means of least squares 
(Mitra and Kaiser, 1993). 

9.3.4 Fast Transient Filter 




Figure 9.14: Signal presenting transient spikes 

The noise can sometimes show extremely sudden transients (spikes) (Fig. 9.14), 
which superpose on a signal which would be smooth in their absence; in general, 
they are caused by the electrical environment of the sensor. They can be 
partially eliminated by allowing a maximum threshold of the signal variation 
between two successive instants. 

Another way to proceed is to consider the m last measurements y{i), and 
to order these measurements by increasing amplitude. Then, the lowest 
points and the highest rah points are rejected; these points are later replaced 
in the signal by immediately lower or higher points. 

Example 9.1: Filtering of Noisy Measurements 

Some measurements in a wastewater treatment plant are obtained in a difficult 
environment and are noisy (Fig. 9.15). These data have been hltered with a 
simple hlter realized as follows. First, a hrst-order continuous hlter with unity 
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gain and an adequate time constant has been created and then discretized with 
Tustin transformation. The choice of time constant has important consequences 
on the aspect of the filtered data (Fig. 9.15). The time constant was chosen 
such that the dynamics were not too damped. 



Raw data 





Figure 9.15: Raw data (top) from a wastewater treatment plant and filtered 
data (bottom) 



9.4 Discrete-Time and Finite-Differences Mo- 
dels 

By its nature, a computer cannot treat analog signals, and only sampled sys- 
tems are considered. Time is discretized or discrete. Consider the continuous 
differential first-order equation 



^ = /(a;,y) (9.107) 

To numerically integrate this differential equation, a large variety of schemes 
are possible (Carnahan et ah, 1969); choose the simplest one: the explicit Euler 
scheme, which consists of approximating the derivative by the backward finite 
difference 



Vn - Vn-l _ ^ 

At dt 



(9.108) 
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The equation to be integrated becomes 



Vn = Vn-l + /(a^n, Vn) At (9.109) 

With the implicit forward hnite difference to approximate the derivative 

1/n+l “ Vn 



At 



dt 



(9.110) 



a different equation would have been obtained 



Vn = Vn+l - f{Xn, Vn) At (9.111) 

It shows that, to a unique continuous-time differential equation, correspond 
as many discrete differential equations as different numerical schemes exist 
for approximating a derivative. As this was emphasized during the statement 
concerning sampling, there is no bijection in the change from continuous to 
discrete time. 

The ideal impulse (Dirac impulse) occurring at time nTg {Tg corresponding 
to the sampling period) is symbolized by S{t — nTs). By use of simplified writing 
(avoiding the rigorous mathematical use of distributions), the sampled signal 
is described at this time as 



y^{nTs) = y{nTs)5{t - nTg) (9.112) 

The sampler is qualified as an ideal impulse. 

The whole signal between 0 and nTg, sampled by a periodic train of ideal 
impulses, can be described by 

y*{t) = y*(0)+y*(T,) + ... + y*(nT,) 

= y{t))5[t) ^ y{Ts)5{t — Tg) ^ ^ y{nTs)5{t — nTg) (9.113) 

= Etoyi^Tg)s{t-tTg) 

The values y{iTg) correspond to constant coefficients and the Laplace transform 
of the sampled signal is easily deduced 

n 

F*(s) =J2yiiTs)exp{-iTss) (9.114) 

^=0 

This point of view is ideal, as an impulse always has a finite length and, in this 
case, the Laplace transform is more complex. 

Most often, a zero-order holder is used, which keeps the signal value between 
two sampling instants constant 

y{t)=y{kTg) if: kTg < t < {k ^ l)Tg (9.115) 

This is referred to as the signal reconstruction. According to the type of holder 
used, the continuous signal thus obtained would be different. 
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9.5 Different Discrete Representations of a 
System 

The Laplace transformation was a practical way for treating the systems de- 
scribed by continuous-time models; the ^-transformation will be the method 
for discrete-time systems. 

In order to avoid confusion, at this level we will introduce the notation F{s) 
for the Laplace transform of f{t) so as to distinguish between the function F{s) 
and the z-transform denoted by F{z) which does not have the same analytical 
form. 

9.5.1 Discrete Representation: z-transform 



u{t) ^ 


^ u*{t) 


Zero-order 


v{t) 


Process 


y(t) 




Holder 









y*(t) 



Figure 9.16: Digital control of a process 

The process, being linear or linearized around its operating point, is character- 
ized by its continuous transfer function 

m = Sg (9.116) 

To perform the digital control, before the process input, a sampler is placed 
so that the sampler output is and a zero-order holder transforms the 

discrete signal into a continuous signal v{t) (Fig. 9.16). At the process 

output, the continuous signal y{t) is itself sampled into ^*(t) to be observed 
with a sampling period Tg. 

Consider any sampled signal, e.g. corresponding to the output y{t) at 

sampling times nTg . The sampled signal (t) can be represented in the form 
of a distribution product 



y*{t) = < y{t),ST,{t) > (9.117) 

where St^ (t) is the periodic train of ideal impulses equal to 

oo 

ST^{t) = J2S{t- nTs) (9.118) 

n=0 

Thus, (t) can be considered as a continuous function, which is zero at every 
point except at sampling times nTg where it can be represented by an impulse 
of amplitude y{nTs). The Laplace transform of the sampled signal ^*(t) results 

oo 

^*(s) = y;y(nTs)exp(-nTss) 

n=0 



(9.119) 
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Setting 

z = exp(T55) (9.120) 

we introduce the z-transform of the signal y(t) 

oo 

Z{y{t)) = Y{z) = F*(s) = ^ y(nT,)z-” (9.121) 

n=0 

Sometimes, the z-transform is dehned as a Laurent series^ 

+ 00 

Y{z)= Y. y(nT,)z-” (9.123) 

n=— OO 

A very important characteristic must be noted: the z-transform depends 
on the value of the sampling period Tg] with a given continuous signal y{t)^ 
knowing T 5 , we will associate a unique z-transform; on the contrary, knowing 
the z- transform of a signal, there exist an inhnite number of continuous signals 
that have the same z-transform. The information contained between the sam- 
pling times is lost. The sampled output ^*(t) corresponds in a unique manner 
to the hxed sampled input u^(t). 



Different presentation (Sevely, 1969): 

The sampled function ^*(t) is equal to Eq. (9.117) and is simply denoted 
by 

y*it) =y{t)5T,{t) (9.124) 

where St^ (t) is the periodic train of ideal impulses. 

Set T(s), the Laplace transform of function y{t). Similarly, At^{s) is the 
Laplace transform of function STs(t), which is equal to 

At, ( 5 ) = C[ST,{t)] = 1 + exp(-5T5) + exp(-25T5) + . . . (9.125) 



Provided that | exp(— sTg)! < 1 , i.e. the real part of s is positive, it is possible 
to write 

AtAs) = C[5TAt)] = i_exp\_,T,) 

According to the convolution theorem (operator *), the Laplace transform 
of the sampled function (t) results 



y (s) =C[y*{t)]=C[y{t)]*C[5TAt)] ^ 

1 pVR-\-joo 

T*( 5 ) =- — : Y{v) At^{s — v) dv 

J VR—joO 

f^VR-\-joo -1 

Y{v) 

VR-joo 



(9.127) 



1 r 

27Tj J ^ 



1 - exp (-(5 - v)Ts) 



dv 



Laurent series is a two-sided infinite power series, e.g. mathematically 

+00 

A{x) = aix'^ 



— 00 



( 9 . 122 ) 
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with the condition SRy < vr < sr — srs setting the notations for the complex 
variables: 

5 = SR^jsi, V = VR^jvR and 

SRy’. largest among the real parts (convergence abscissa of y{t)) of the poles of 

Srs’. convergence abscissa of Sts(s), {srs = 0). 

The Laplace transform of fho periodic train of ideal impulses has 

the following poles: ps^k = s-\-j2k7r/Ts. On the other hand, we can assume that 
the poles of Y (s) have a negative real part (corresponding to a stable process) 




Figure 9.17: Integration by the residuals method with a contour including 
the poles of Ar^ (s — v) 



The complex integral is calculated by the residuals method, where the integra- 
tion contour (Fig. 9.17) is formed by a half-circle of infinite radius centred at 
(vr, 0), covered in the clockwise way. As the poles of Y (v) have a negative real 
part, this contour includes only the poles of Ar^(s — v). We obtain 



Residuals of Y (v) 



■ exp(—(s 



By setting the ratio of two polynomials 



Y(v) 



N(v) 



(9.129) 
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we obtain the residual rk relative to a given pole ps^k 



N{ps,k) 

[D{v) {1 - exp(-(s - v)Ts)}]v=p,,, 



Y{s + j2kTr/Ts) 
Ts 



(9.130) 



thus 

^ oo 

W(s) = - ^ Y{s+j2kTT/Ts) (9.131) 

7 

k=—oo 

assuming that y{t) = 0 for t < 0 and ^(0) = 0. If the signal y{t) is discontinuous 
at 0, ^(0) 7 ^ 0 and 



1 1 _ 

Y*{s) = -y{0+) + - ^ Y{s+j2k7r/T,) (9.132) 

k=—oo 




Figure 9.18: Integration by the residuals method with a contour including 
the poles of Y (v) 

Instead of taking the previous integration contour, it is possible to take as a 
contour a half-circle of inhnite radius centred at covered in the anti- 

clockwise way (Fig. 9.18). This contour includes the poles of Y{v)^ which are 
denoted by Py^i^ leaving aside the poles of (5 — 'c). The previous expression 
becomes 

Y ( 5 ) = Residuals of 

Py,i ^ 



y{v) 

1 - exp(-(5 - v)Ts 



(9.133) 
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which can be transformed by setting z = exp(5T5), resulting in 

Y{v) 



Zivit)) = Y{z) = 



Residuals of 



1 — exp('cT5) 



(9.134) 



V=Py 



This expression depends only on z and no more on s. 

When the poles of Y (v) = are simple, the residuals can be calculated 
by an expression of type (9.130), so 



ri = 



N{Py,i) 



N{Py,i) 



-^[D{v) {1 - exp(wTs) 
giving the expression of the z-transform 

Z{y{t))=Y{z) = Y,— 



[1 -exp(pj,,*Ts)z-i] 



N{Py,i) 



[1 - exp(pj,,*Ts) z-i] 



(9.135) 



(9.136) 



In the case where a pole Py^i is a multiple of order n, the residual corres- 
ponding to this pole can be calculated as 



1 



(n — 1)! 



jfi-i 

5^ 



y(») 



1 — exp('cT5) z 



-1 



(9.137) 



A contour surrounding the set of the poles of Y (v) At^ {s — v) could have 
been chosen for integration. 



The impulse response of the zero-order holder^ (Fig- 9-16) is a rectangular 
impulse of amplitude equal to 1 and of length Tg] its transfer function is the 
Laplace transform of this impulse response 



V{s) l-exp(-Tss) 



U {s) 



It results that 






(9.139) 



(9.140) 



The input u*{t) is, in reality, a sequence of impulses of amplitude u{nTs) at 
time nT, 



u*{t) = ^ u{nTs)5{t - riTs) 



(9.141) 



n=0 



'^Sometimes, the zero-order holder is defined as 



h{t) = I Ts 
0 



if 0 < t < Ts 
otherwise 



(9.138) 



In this way, the area is normalized to 1, in a similar manner to the continuous Dirac function 
as defined by physicists. 
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so that the output y(t) is the response to this train of impulses. The response 
yn{t) corresponding to the input u{nTs) is such that 

yn{t) = u{nTs)g{t - nTs) (9.142) 

from the previous dehnition of the impulse response. The complete signal y(t) 
is the superposition of the responses to all inputs u{nTs) so that 

oo oo 

y{t) = Y. Unit) — ^ ^ u{nTs)g{^ (9.143) 

n=0 n=0 

The z-transform of the output y(t) is equal to 

oo 

Y{z) = F*(s) = ^ y{nTs)z-^ (9.144) 

n=0 

By using Eq. (9.143) and the previous relation, the input-output relation results 

oo oo 

u{kTs)g{nTs - kTs)z~^ (9.145) 

n=0 k=0 

The z-transform of the input u{t) is equal to 

oo 

U{z) = YHnTs)z~^ (9.146) 

n=0 

The relation (9.145) can be simplihed: set i = n — k. Then, it can be separated 
into a product of two sequences 

oo oo 

Y{z)= ^ 5 (iT,)z-'^«(fcT,)z-'“ (9.147) 

i=—k k=0 



Taking into account the causality: the impulse response is zero for negative 
times: g{iTs) = 0 when i < 0, we obtain 

W = ET=o9{iTs)z-^EZo<mz-'^ .0 140 ^ 

= G{z)U{z) ^ > 

where G{z) is the z-transform of the impulse response g{t) of the system con- 
sisting of the set (zero-order holder + process), and is called a discrete or 
sampled transfer function or z-transfer function (also called impulse transfer 
function). 



Existence Domain of z-transforms 

The two-side z-transform is dehned by 

oo 

Y{z)= Y y(nTs)z-^ (9.149) 

n=— OO 
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and thus appears as an extension of the one-side z-transform. This transform 
is useful for signals that increase exponentially, on one side only. To ensure the 
convergence of a power series as 

oo 

'^Ui (9.150) 

^=0 

it is necessary and sufficient, according to Cauchy’s criterion, that 

lim < 1 (9.151) 

i^oo 



This condition is applied to both series constituting the two-side z-transform 

— 1 oo 

Y(z)= y(nT,)z-” + ^y(nT,)z-” =y_(z)+y+(z) (9.152) 

n=— OO n=0 

which gives for l + (^) 

lim \y{nTs)z~^\^^^ < 1 (9.153) 

Supposing that 

lim |y(nTs)y” = (9.154) 

n^oo 

the series l + (^) converges when 



1^1 > Py+ 

Changing n into — n, similarly the series Y- (z) converges when 

|z| < Py- 



assuming that 



(9.155) 

(9.156) 



lim |y(-nTs)| (9.157) 

n^oo 

The convergence domain is thus a ring between the circles of radii py^ and py- 
(Fig. 9.19). 



z-transform of a Step Function 

Consider a step y(t) of amplitude A and infinite duration. The corresponding 
sampled function is a periodic train of ideal impulses: ^*(t) = A5tX^)' The 
z-transform of y(t) is equal to 

Y{z) = Z[y{t)] = A + + . . . (9.158) 

For this series to be convergent, it is necessary that 

|z| > 1 (9.159) 

which corresponds to 5 > 0. It results that 

A Az 



Z[step] 



1 - z-i 



z- 1 



(9.160) 
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Figure 9.19: Convergence region for the two-side z-transform 



z-transform of an Exponential Function 

Consider the function y(t) = Aexp(— at) with a > 0; the z-transform of this 
function is equal to 

oo 

Y(z) = Aexp{—anTs)z~^ (9.161) 

n=0 

For the series to converge, it is necessary that | exp(aT 5 )z| > 1, which corres- 
ponds to 5 > —a. In this case, the series converges towards 



Y{z) 



A 

1 — e-xp{—aTs)z~^ 



Az 

z — exp(— aTg) 



(9.162) 



By proceeding in this manner, the invariance is ensured in the time domain, 
i.e. the continuous function f{t) and the discrete function f{nTs) perfectly 
coincide at the sampling times. Thus, it is possible to derive Table 9.4, which 
gives the z-transforms of some classical functions in parallel with their Laplace 
transforms. 



Properties of the z-transform 

Linearity: 

By using the dehnition of the z-transform of a function, it directly results that 
the ^-transformation is a linear transformation 

Z[aifi{t) +«2/2(^)] = aiFi{z) + a2F2{z) (9.163) 

Translation by a real (time- delayed system): 

When a process presents a time delay equal to an integer number of sampling 
periods: n^Tg, the impulse response of this system, which can be symbolized by 
the three blocks: zero-order holder, process, time delay, is equal to /(t — n^Tg), 
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Table 9.4: z-transforms of some classical functions 



Time 

Function 


Laplace 

Transform 


z-transform 


Dirac: S{t) 


1 


1 


Delayed Dirac: 






5(t- riTs) 


exp(— nTgs) 


^-n 


Unit step: 1 


1 


Z 




z-l 


5 




1 


TgZ 


Ramp: t 




(Z-1)2 


Exponential: 


1 




exp(— at) 


Z 




z — exp(— aTg) 


5 T a 


Second-order 


1 


z(exp(-aTs) - exp(-6Ts)) 


(5 + a) (5 + 5) 


{b - a){z - exp{-aTs)){z - exp(-5Ts)) 


t exp(— at) 


1 


TsZexp{-aTs) 


(s + a)2 


{z - exp{-aTs)r 


Cosinus: cos (cat) 


s 


z{z — cos{ujTs)) 




z‘^ — 2z cos{ioTs) + 1 


5^ + ca^ 


Sinus: cos(cat) 


u 


z sin(caT5)) 




z‘^ — 2z cos{ioTs) + 1 


5 ^ + ca^ 



with respect to the system with no time delay: zero-order holder, process, 
having as an impulse response f{t) and a transfer function F{s). The Laplace 
transform of the delayed system would be F{s) exp(— n^Tgs). The z-transform 
of the delayed system is equal to 

Z[f{t - UrTs)] = z-^^F{z) (9.164) 

Translation by a real (system with advanee): 

Denote by Ua the integer number of sampling periods corresponding to the 
advance, noticing that 



Z[f{t + riaTs)] = /[(* + na)Ts]z~^ (9.165) 

^=0 



n, 



F{z)-z^“f{0Ts)-z^“-\f{lTs) 

\f{^) - Er=o"' f{iTs)zT 



zf{{Ua - l)Ts) 



we obtain 
Z[f{t + UaTs 



= Z' 



(9.166) 
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Translation by a complex number: 

Consider the function f(t) that contains an exponential term as a factor (this 
is qualihed as a translation by a complex number by analogy to the similar case 
of the Laplace transform). The z-transform is equal to 

Z[exp{-aTs)f{t)] = F{z exp(aTs)) (9.167) 

as 

Z[exp{-aTs)f{t)] = 

= J2n=o fi'>^r)[expiaTs) z]-^ (9.168) 

= F{z exp(aTs)) 

Multiplication by t^: 

It can be shown that 

ZlCfit)] = -rz-^[Fi{z)] with: Fi{z) = ^^-'/(t)] (9.169) 

which gives, for /c = 1, 

Z[tf{t)] = -TszTf{z)] (9.170) 



Theorem of the initial value: 

It is possible to know the initial value of a function from its z-transform 

lim f{nTs) = lim F{z) (9.171) 

n^O z^oo 



Theorem of the final value: 

Consider 

Z[f{t + m - fit)] = zF{z) - z/(0) - F{z) (9.172) 

Use the series expansion 

oo n 

zF{z) - F{z) = (z - 1) V finTVz-^ = (^ - 1) lim V fikTs)z~^ (9.173) 

^ ^ n^oo ^ ^ 

n=0 k=0 



On the other hand, we have 



z[fit+m-fit)] 



T,ZMi^ + m]-f{nTs)]z- 

lim™ ELo[/[(^ + - fikr)]z-^ ^ 



Let z tend towards 1 

n 

lim V[/[(fc + 1)T,] - fikTs)]z->^ = f[{n + 1)T,] - /(O) (9.175) 

Z^l ^ ^ 

k=0 



It results that 



= lim.^i Z[f{t + m - fit)] 

= limz^i[zFiz) - z/(0) - F(z)] 
= lim2^i[(z - l)F(z)] - /(O) 



lim„^oo[/[(n + l)Tg] - /(O)] 



(9.176) 
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By simplification of the constant term /(O), we get 

lim /[(n + 1 )^ 5 ] = lim[(z — l)F{z)] (9.177) 

n^oo z^l 

which constitutes the theorem of the final value, in general given in the form 
lim f(nTs)= \im[(z — l)F(z)] (9.178) 

n^oo z^l 

Theorem of summation: 

The summation theorem can be written as 

- n 1 ^ 

2 J2f{kTs) =^—^F{z) (9.179) 

.k=0 J 

Demonstration: 

Consider the following series 

n 

g{nm = Y^f{m) (9.180) 

k=0 

This series is such that 

5 (nT,) - g[{n - 1)T,] = /(nT,) (9.181) 

which gives the z-transform 

Z[g{nTs)] - Z[g{{n - 1 )T,)] = 2 [/(nT,)] (9.182) 

G{z) - z~^G{z) = F{z) (9.183) 

hence the summation theorem. 

Theorem of Parseval: 

This theorem is essential in signal processing, where it expresses that a signal 
energy is equal to the sum of the energies of its components. In the framework 
of the z- transform, it is expressed as 

oo 17 

V y (nT,) = — f z-^F{z)F{z-^)dz (9.184) 

Demonstration: 

As 

/(nT,) = f-(f z^-^F{z)dz (9.185) 

27TJ Jc 

we can write 

= 2 ^ Ic E“=o finTs)z^] z~^dz 

= ^ k^iz)F{z-^)z-^dz 



(9.186) 
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Derivation and integration with respeet to a parameter. 







(9.187) 


nai 

Z / f{nTs^a)da 

-dao 


pai 

= / F{z^a)da 

J ao 


(9.188) 


Limit with respeet to a parameter: 






Z lim f{nTs^a) 

a^ao 


= lim F{z, a) 

a^ao 


(9.189) 


Theorem of diserete eonvolution: 


1 




F,{z)F 2 {z) =Z hikTs)f 2 [{n - k)Ts] 


(9.190) 



Ln=0 



Inversion of z-transform 

The passage from the z-transform F{z) to the continuous function f(t) is not 
unique. As a matter of fact, F{z) is the z-transform of the continuous function 
/*(t) obtained by sampling with period Tg and the zero-order holder of the 
function f(t). The information loss between two sampling instants forbids the 
reconstruction of the function f(t). The inverse transformation will be denoted 
by Z-i 

r{t) = {f{nTV} = Z-^[F{z)] (9.191) 

Several methods allow us to obtain the set {/(nTg)}, such as the residuals 
method, polynomial division, expansion as a sum of rational fractions or simply 
numerical calculation on a computer. 



a) Method of residuals. This method relies on a theorem by Cauchy 



— forfc = -l 

27TJ Jc i 0 for A; 7^ -1 

C being a contour surrounding the z-plane origin. 

On the other hand, use the expansion of F{z) 



(9.192) 



F{z) = J2finTs)z-^ 



(9.193) 



To display the term /(nTg), multiply on both sides by z^ and integrate over 
C, which gives 

/(nT,) = A / z^-^F{z)dz (9.194) 

27TJ Jc 

This integral can be calculated according to the residuals method; choose a 
contour surrounding the z-plane origin and the poles pi of F{z) 



f{nTs) = [ residuals of z'^ ^F{z) 
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b) Polynomial division. The function F{z) is often presented as a rational 
fraction with respect to z 



P{z) = = 6 q + &1 z "" ^ + . . . + hn,, 

A(z) ao -\- ai 



(9.196) 



with the condition ria > so that the transfer function is physically realizable: 
the controlled output must not precede the manipulated input. For example, 



F{z) 



-0.5 



z - 1 



is not physically realizable. 

The function F{z) is still more often expressed as a rational fraction with 
respect to this ratio of two polynomials can be expanded as a series in 
Thus 

F( \ = ^ bo + bi Z-^ + . . . + bn„ Z-^’> 

ao + «1 + . . . + (9.197) 

= /(O) + /(T,) z-i + . . . + /(nT,) + . . . 

with 

^ k — l 

f{kTs) = — [bk-'Y] ak-if{iTs)] with = 0 for A; > nt, (9.198) 

n.n ' ^ 



Example 9.2: Polynomial Division 

Given the following discrete transfer function 

2z-3 



F{z) = 



+2z - 1 



find the first values of fk = f{kTs) by polynomial division 
The polynomial division is represented by 

3^2 



2z 



-3 



-2z - 






-1 



+ - 



13 

Y 

13 



+ 3^ 

26 



-1 



-1 



13 



-2 



32 



-1 



13 

~~ 9 ^ 



-2 



T2z 



-1 






-1 



13 



It results that the first values of f{kTs) are 

2 



/o = 0 



fi = 



/2 = - 



13 



fs = 



32 



(9.199) 



-2 



32 



-3 
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c) Sum of rational fractions. In an analogous manner to that used for 
the Laplace transformation, knowing that F{z) possesses ria poles, F{z) is 
expanded as a sum of rational fractions with respect to according to 



F{z) 



Cl 



1 — Pi z ^ 






^-PriaZ 



-1 



(9.200) 



thus 



/(nT,) = Z-i 



Cl 



1 — Pi z ^ 






. 1 - ^ r 



and 



finTs) = Cip”^» + . . . + 

The scalar coefficients q are real or conjugate complex. 



(9.201) 

(9.202) 



Relation between Discrete Transmittance and Difference Equations 

A differential equation can be discretized by any finite-difference method (Euler, 
forward or backward differences, etc.). The system is then represented by the 
general equation with constant coefficients and bi 

+ CLlVn-l + • • • + an^Vn-na ~ ^O'^n + + • • • + bnf^Un-rib (9.203) 

where the initial conditions should be specified. Using the fact that 



Z{yn-^) = z-^Y{z) 



(9.204) 



we pass in a bijective, thus unique, manner from the finite-difference equation 
to the discrete transfer function 



G(z) — ^ ^ ^ + . . . + brn^z 

U{z) ao F aiz-'^ ^ ^ 



(9.205) 



In general, as the output y is not immediately infiuenced by the input, we get 
5o = 0. The concept of physical realizability is expressed by the fact that the 
output depends only on the past inputs: thus, yn cannot depend on iXn+i- 

Example 9.3: Discrete PID Controller 

The analog PID controller corresponds to the following equation between the 
deviation variables (e(t) controller input and r{t) controller output) 



r 



e(t) + 



1 C 

— / e{x)d: 

D Jo 



'x F td 



de{t) 



m = K, 



dt 



(9.206) 
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giving, for example, the following difference equation (here based on a backward 
and non-unique difference) 



'^n— 1 — -^r 



-L 'a ( Gfl Gfl l) TD / ^ \ 

“ ^n-1 H 7 ^ K ~ 2e^-l + 6^-2) 

Ti 2 Is 



which gives the discrete transfer function 

R{z) bo + biz~'- + b 2 Z~'^ 



G{z) — 



E{z) 



1 -z 



-1 



(9.207) 

(9.208) 



with bo = Kr{l + ^ + ^) > h= Kr{-1 + ^ ~ > h = Kr^. 

It would have been possible to consider the continuous transfer function 
associated with the PID 



G{s) = Kr 



1 H h tds 

TjS 



(9.209) 



By applying the z- transform according to Table 9.4, a different expression of 
the discrete transfer function results 



G{z) = Kr 



1 + 



Tl{z - 1) 



+ td- 



z - 1 



(9.210) 



Stability Analysis 




Continuous time 



Discrete Time 



Figure 9.20: Stability domains (position of the poles) 
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Consider the system described by the discrete transfer function (with Ua > 
Til) for the z-transfer function) 



G(z) = ^ ^ Ng{z) 

U{z) ao + + . . . + Dg{z) 



( 9 . 211 ) 



Submit this system to a stable input u{t) defined by its z-transform as a ratio 
of two polynomials 



U{z) = 



Nu{z) 



Du{z) 

The z-transform of the output is equal to 



with: deg(AT„) < deg(T>„) 



N2{z) 



Ni{z) 



+ 



Dg{z) Du{z) 
= Y^{z)^Yf{z) 



( 9 . 212 ) 



( 9 . 213 ) 



In the expression of the transform of the output Y (z), in the transform Yf(z) of 
the forced response we recognize the influence of the input u{t) through the sta- 
ble denominator Du{z), thus the forced response yf{t) is stable. There remains 
the transform Yn{z) of the natural response whose stability is conditioned by 
the denominator Dg{z) of the discrete transfer function G{z). 

The transform Yn{z) of the natural response can be expanded as a sum of 
rational fractions 

ria 

( 9 - 214 ) 

1=1 

where Zi are the roots of the denominator of G{z)^ thus the poles of G{z). 
The poles of G{z) are the z values for which the discrete transfer function 
G{z) tends towards inhnity, the zeros of G{z) being the z values for which the 
discrete transfer function G{z) tends towards 0. Consider the case where only 
one pole exists, so that 



T„(z) = -Y~ = Y y(^T)z-^ (9.215) 

^ n=0 

The rational fraction can be expanded as a series, which converges only if 
|zi/z| < 1 

Yn{z) = = z ^ = z ^ci[l + ( — ) + ( — )^ + . . .] (9.216) 

Z Z\ z z 

z 

hence 

VnijiTs) = ci{ziY~^ (9.217) 
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Thus, the condition of stability is that the z poles of the discrete transfer 
function are situated inside the circle of unit radius in the complex plane (Figs. 
9.20 and 9.21). 




Stable pole 

with positive real part 



Stable pole 

with negative real part 



Unstable pole 
with positive real part 



Two complex 
conjugate poles 
with positive real part 



Two complex 
conjugate poles 
with negative real part 








Figure 9.21: Influence of the position of the poles of a discrete transfer func- 
tion on a step response 
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The analogy of this condition for discrete time with the parallel condition 
for continuous time can be realized: in fact, the continuous-time stability de- 
mands that the poles are located in the left complex half-plane corresponding 
to 7Ze{s) < 0. The transformation 

= exp(T55^) (9.218) 

then gives the discrete-time condition 

< 1 (9.219) 

In practice, stability is, of course, essential, but performance and robustness 
must also be taken into account. For this reason, in the same manner as in 
continuous time, the requirements of gain and phase margins are such that all 
of the left half-plane is not favourable (avoid being too close to the imaginary 
axis); in discrete time, only one part of the unit circle is recommended. The 
influence of the position of the poles on the response to a step input appears 
clearly in Fig. 9.21. 

The condition of stability of the transfer function G{q) is stated as 

oo oo 

0(4) = E is stable \ y ^ ^ |^(/c)| oo (9.220) 

k=l k=l 

The consequence is that the associated series expansion 

oo 

G{z) = converges V |z| > 1 (9.221) 

k=l 

This function G{z) is said to be analytic on the unit circle and outside the unit 
circle: G{z) has no poles on the circle and outside the circle. 

Criterion of stability of Jury. The criterion of stability of Jury provides 
necessary and sufficient conditions for a real-coefficient polynomial to have its 
roots inside the unit circle. 

Consider the polynomial 

P{z) = gqz^ -\- a\z^ ^ + . . . + an—iz -\- (9.222) 

whose coefficient ao is positive, then compile the associated Table 9.5. 

The two hrst rows of Table 9.5 simply contain the coefficients of the poly- 
nomial taken in the opposite order. The following row of the table (coefficients 
bi) is obtained by subtracting from the hrst row the second row multiplied by 
the ratio ri, and so on until the row 2n + 1. 

The necessary and sufficient conditions for the roots of P{z) (where ao > 0 
is imposed) to be situated inside the unit circle are that the head coefficients 
ao, 5o, Co, . . ., are positive. 
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Table 9.5: Table of the stability criterion of Jury 



Row 


















1 


do 


di 


d2 




dn-2 


dn—1 


dn 




2 


dfi 


dfi—1 


dn-2 




d2 


di 


do 


dn 

ri = — 
do 


3 


bo 


bi 


b2 




bn-2 


bn—1 


0 


(1st row of di) 

— ri (2nd row of di) 


4 


bfi—i 


bn-2 


bfi-3 




bi 


bo 




bn — l 

bo 


5 


Co 


Cl 


C2 




Cn-2 


0 




(1st row of bi) 

— V 2 (2nd row of bi) 


6 


Cn-2 


Cn-3 


Cn — 4 




Co 






Cn-2 

T3 = 

Co 


2n + 1 


Wo 


0 















Example 9.4: Stability of a Second-Order Transfer Function 

A classical example concerns the stability of the sampled following second-order 
transfer function 



G{z) 



hoz‘^ + biz + 1)2 
z‘^ + aiz + 



(9.223) 



Table 9.6: Table of Jury stability criterion in the case of a discrete second- 
order transfer function 



Row 










1 


1 




dl 


d2 




2 


d2 




dl 


1 


d2 

n = T 


3 


1 -ai 




dl — did2 


0 




4 


dl — d\d2 




l-al 




dl — did2 dl 

^2=1 2=1, 

1 — 0-2 1 T d2 




1 — d\ — (dl — d\d2j 




0 






o 


1 

l + d2 







Jury Table 9.6 results yield the following two conditions 

1 — >0 |a2| < 1 

1 — — (ui — aia2) — > 0 (1 + a2)^ — af > 0 

1 + U2 



1 + tt2 > |ui I 
(9.224) 
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The stability domain with respect to both parameters ai and a 2 resulting from 
these conditions is the inside of the triangle of Fig. 9.22. 




Figure 9.22: Stability domain of the discrete second-order transfer function 
with respect to its parameters 



Causality and time delay. Consider a discrete transfer function 



G{z) — 



Y{z) _ 6 q + + . . . + bnt,Z^ 

U (z) uq + diz + . . . + 



A system is causal when the output depends only on the past inputs. This 
physical condition implies for the discrete transfer function, expressed as a 
ratio of two polynomials with respect to z (not to !), that 



Un > rih 



(9.226) 



Such a discrete transfer function is proper. 
The relative degree r is dehned by 



r = ria- rib 



(9.227) 



If the relative degree r is zero, the discrete transfer function is biproper and 
the output reacts without delay to an input variation. If the relative degree is 
strictly positive, the discrete transfer function is strictly proper and the output 
reacts with a delay of r sampling periods to an input variation. 
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9.5.2 Conversion of a Continuous Description in Discrete 
Time 

Suppose that a process is known by its continuous transfer function G{s). It is 
interesting to deduce a corresponding discrete transfer function. In fact, as the 
discretization of a continuous differential equation is not unique, the discrete 
representation depends on the mode of transformation chosen. A given corre- 
spondence between 5 and z will be used allowing us to transform a continuous 
transfer function into a discrete transfer function and the transformation will 
be qualified as a frequency one. 

Here, let us cite several possible transformations which allow us to get a 
discrete transfer function G{z)^ knowing the continuous transfer function G{s). 
Tg represents the sampling period. 

Consider a continuous system described by the state equation 



x{t) = Ax{t) + Bu{t) 



(9.228) 



subjected to a zero input. Integrate the resulting differential equation between 
two instants to and t\. Moreover, assume that t\ = to If we approximate 

the integral of Ax(t) by the area of the rectangle of length Tg and height x(to), 
we obtain the approximation 



x(to + Ts) - x{to) w Ax(to)Ts a: (to + T^) — ^ Ax(to) 

-L s 

hence 

^U\ ^ + X) - x(t) 

x[t) ~ ^ 

leading to the following formula of forward difference. 

Forward difference (integration) or implicit Euler method: 

z-1 



(9.229) 



(9.230) 



(9.231) 



The stability of G{s) is not maintained by this transformation, which is equiv- 
alent to placing a zero-order holder at the input. 

Backward difference (integration) or explicit Euler method: 



s = 



z-1 



TgZ 



(9.232) 



The stability of G{s) is maintained by this transformation. 
Bilinear or trapezoidal integration, or Tustin method: 



2 z-1 
Tg z -\- 1 




s = 



z = 



(9.233) 
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This mapping: s ^ z is bijective (to any value of s corresponds one and only 
one z, and vice versa). It offers the advantage of making the imaginary axis of 
the complex plane of s (stability limit in continuous time) correspond to the 
unit circle of the complex plane of z (stability limit in discrete time). Moreover, 
to the left half-plane of 5 (stability region in continuous time) corresponds the 
inside of the unit circle for z (stability region in discrete time). The Tustin 
method is frequently used. In particular, it allows us to perform the frequency 
analysis of a discrete-time system from a continuous-time system by the trans- 
formation of s into z = exp{jcjTs), where co is the frequency of the digital 
system associated with the frequency uJ of the analog system by the relation 

9 , j'T' 

^=^tan(Y) (9-234) 

To the domain [0,+oo[ of the analog frequency uJ corresponds the domain 
[0,+7 t/T 5[ of the digital frequency co (Fig. 9.23); the figure might have been 
extended to the domain of negative frequencies which is symmetric with respect 
to the axis origin. This compression from analog to digital frequency is called 
frequency warping. While the unit of analog frequency is rad/s, the unit of 
digital frequency is rad/sampling interval. Thus, the highest digital frequency 
is 7T rad/sampling interval and is called the Nyquist frequency. In different 
units, it is equal to 



Nyquist frequency 



7T rad/sampling interval 

^ rad/s 
Ts ' 

— Hz 
2Ts 

- revolutions/s 



(9.235) 



The normalized frequency (which is not always normalized in the same man- 
ner!) takes as reference the Nyquist frequency, where the latter can be indicated 
as being equal to tt (refer to rad/sampling interval units) or 0.5 (refer to revo- 
lutions/s units). 

In a more general formulation (Mitra and Kaiser, 1993), the Tustin trans- 
formation can be expressed as 



z- 1 

s = a z 

z^l 



1 + - 
a 

1 - - 
a 



(9.236) 



where a would take the value a = 2/Tg in the previous Tustin transformation. 
The stability properties mentioned above are maintained. 

Method of pole-zero eorrespondenee: 

Consider a continuous transfer function G{s) having poles pi and zeros Qi. Ac- 
cording to the formula z = e-xp{sTs), each pole pi is transformed into exp(p^T 5 ) 
and each zero Qi into e-xp(qiTs). Moreover, the steady- state gains of the con- 
tinuous and discrete transfer functions are equal. 
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Ts 

Figure 9.23: Correspondence between the analog frequency uJ and the digital 
frequency u for Tg = 2 



Example 9.5: Transformation by Pole-Zero Correspondence 

By using the method of pole-zero correspondence, to the following continuous 
transfer function 



(s + 2)(s + 3) 



(9.237) 



corresponds the discrete transfer function 



^ 5(z -exp(T^)) 

{z - exp{-2Ts)){z - exp(-3Ts)) 



(9.238) 



In this form, the gain of the discrete transfer function would be different from 
the steady-state gain of the continuous transfer function. It suffices to multiply 
the numerator of the discrete transfer function by a coefficient c to ensure the 
same steady-state gain. 



The drawback of the pole-zero correspondence is that if the continuous transfer 
function possesses more poles than zeros, the discrete transfer function will have 
a relative degree equal to the difference of the number of poles and zeros, so 
that it introduces a time delay equal to the relative degree. For this reason, 
often a polynomial of degree r — 1 is artificially introduced in the numerator in 
order to bring back the delay to the minimum delay of one sampling period of 
the output with respect to the input. 

In Table 9.7, some discrete transfer functions corresponding to continu- 
ous transfer functions are given in the case of backward difference (zero-order 
holder). Differences may be noted by comparison with classical references. Ta- 
ble 9.7 is in agreement with numerical results given by Matlab©, in particular 
with respect to the steady-state gain. 
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Table 9.7: Continuous-discrete transformation by assuming a zero-order 
holder at the input (backward difference of Euler type) 



Time 

Function 


Continuous 

Transfer 

Function 

G{s) 


Discrete 

Transfer 

Function 

G(z) 


Dirac: 6{t) 


1 


1 


Delayed Dirac: 6{t — nTs) 


exp(— nTss) 






1 


zTs 


Unit step: 1 






s 


z - 1 


Ramp: t 


1 


Ts z+\ 


1 


2 (z-l)2 


Exponential: exp (—at) 


c 


s + a 


z — exp(— aTs) 


Second-order: exp (—at) 


1 


c{z + 1) 


(s + a)(s + h) 


{z - exp(-aTs))(z - exp(-6Ts)) 



9.5.3 Operators 

Operator q: 

Instead of ^-transformation, the forward shift operator q or the backward shift 
operator q~^ can be defined as 

y{t + l)=qy{t) or: y{t - 1) = q~^y{t) (9.239) 

The difference equation of a linear system 

(^OVn + CLlVn-l + • • • + an^Vn-na ~ ^O'^n + + • • • + bmUn-rih (9-240) 

thus becomes 

[ao + aiq~^ + . . . + yit) = [&o + hiq~^ + . . . + u{t) (9.241) 

which will be simply denoted by 

= B{q)u{t) (9.242) 

where A{q) and B{q) are the following polynomials, in fact, depending on q~^ 
in the form 

A{q) = a^^aiq + ^0 94^^1 

B{q) = 6o + + ■■■ + &«„?■”" ^ ^ 

The ratio of both polynomials will be considered as the discrete transfer ope- 
rator of the discrete transfer function (strictly speaking, G{z) should be used 
in the latter case) of the system. For linear systems, the forward shift operator 
q and the variable z defining the z-transform are equivalent, but the operator 
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q is defined only by Eq. (9.239), so that it can be applied to any discrete-time 
system, thus as well to nonlinear systems. 

The variable z is analytical: we can speak of numerical values Zi of the poles 
of a transfer function G{z). The operator q does not possess any numerical 
value; it gives the transfer function G{q)^ whose mathematical expression is 
strictly identical to G{z). 

Operator 5 : 

The operator 5 is defined from the operator q by the relation 

5 ^ (9.244) 

where Tg is the sampling period. With the relation between q and 5 being 
linear, all operations previously realized with the operator q (or z for a linear 
system) can be performed with operator 5. In particular, this definition can 
be used to transform a discrete transfer function G{q) (or G{z) in this linear 
case) into a transfer function H{S). The stability condition z < 1 becomes 
1 + STg < 1. In state space, by means of the operator S, a linear system can be 
written in the general form analogous to Eq. (7.1) 



oXk = — = A^Xk + B^Uk 

s 

Vk =CAXk 



(9.245) 



This system is equivalent to the discrete system 

f Xk-\-i — (yl T TgA^^Xk T TgB A^k 
\ Vk = CAXk 



(9.246) 



With regard to the matrices of the continuous system, the matrices Aa and 
Ba are respectively equal to 



Aa 



exp(AT5) - 1 



Ba = 



exp(At)Bdt 



(9.247) 



Example 9.6: Discretization of a Continuous Second-Order Transfer 
Function 



Consider, for example, a continuous second-order transfer function 

Two types of transformation have been applied to this transfer function to 
obtain a discrete transfer function: sampling with a zero-order holder and a 
Tustin bilinear transformation (calculations with Matlab©). The left column 
of Table 9.8 results. In both cases, different sampling periods Tg have been 
used. Then, the operator S has been applied to each of the discrete transfer 
functions. It appears that, unlike other z-transfer functions, the denominator 
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Table 9.8: Transformations from a continuous transfer function into a discrete 
transfer function by two different transformations and resulting (5-transforms 



Sampling 


Zero-Order 


(5-Transform 


Period 


Holder 




Ts=l 


0.0494Z + 0.0442 


0.4447(5 + 0.8426 


- 1.6229Z + 0.7165 


9(52 + 3.3938(5+0.8426 


Ts = 0.1 


10-*(5.494z + 5.433) 


0.0494(5 + 0.9834 


- 1.9661Z + 0.9672 


9(52 + 3.0489(5+ 0.9834 


Ts = 0.01 


Period Tg too low 


0.0050(5 + 0.9983 


9(52 + 3.0050(5+ 0.9983 


Sampling 


Bilinear 


(5-Transform 


Period 


Transformation (Tustin) 




Ts=l 


0.0233^2 + 0.0465Z + 0.0233 


0.2093 


z2 - 1.6279Z + 0.7209 


9(52 3.3488(5 + 0.837 


Ts = 0.1 


10-^(2.731^2 5.463;2 + 2.731) 


0.2458 


z2 - 1.9661Z + 0.9672 


9(52 + 3.0483(5+ 0.9833 


Ts = 0.01 


Period Tg too low 


0.2496 


9(52 + 3.0050(5+ 0.9983 




Figure 9.24: Comparison of the step responses resulting from a continuous 
transfer function and from the discrete transfer function with a zero-order 
holder (left) or from the discrete transfer function with Tustin transformation 
(right) 



of S transfer functions is close to that of the continuous transfer function and 
gets all the closer because the sampling period is low. 

In Fig. 9.24, the step responses obtained from the continuous transfer func- 
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tion, from the discrete transfer function with a zero-order holder (response A) 
with sampling period T 5 = 1 , and from the discrete transfer function with 
Tustin transformation (response B), are compared. It appears clearly that the 
step response A remains under the continuous curve while the step response B 
is shared on both sides of the continuous curve. 



In Example 9.6, we thus empirically hnd again the fact that the operator S 
tends towards the derivation operator when the sampling period tends towards 
zero 

lim S = — (9.249) 

Ts^o dt ^ ^ 

Similarly, if the models were expressed in state-space form, the coefficients of 
the matrices of the S model and of the continuous s model would be close. 

To simulate a known discrete system as a discrete z-transfer function, De 
Larminat (1993) recommends that in order to improve the numerical robustness 
to go through the intermediary of the S model according to: 

1. Conversion of the discrete z-transfer function into a discrete (5-transfer func- 
tion according to H (6) = G{1 STg). 

2 . Passage from the discrete S transfer function to the discrete S state-space 
model and simulation of this model (Middleton and Goodwin, 1986). 

A reference textbook on the interest in S operator is by Middleton and Good- 
win (1990). In particular, the link continuous-discrete is emphasized at high 
sampling frequencies, with the improvement of the numerical properties by 
comparison with the operator q. The properties of the (5-transform as well as 
the (5-transforms for a certain number of usual functions are given. 
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Chapter 10 

Identification Principles 



The problems of identification of a linear system are discussed during three 
successive chapters: Chap. 10 presents, at a general level, non-parametric and 
parametric identification with the calculation of a predictor; Chap. 11 presents 
models and methods for parametric identification and Chap. 12 presents the 
algorithms of parametric identification. 



10.1 System Description 

The concerned systems are supposed to be linear and time-invariant. 



10.1.1 System without Disturbance 

The linear system of input u{t), of output y{t), which is time-invariant and 
causal, can be described by its impulse response g{k) such that 



y{t) ='Y^g{k)u{t -k) , t =0,1,2,... (10.1) 

k=l 



The sampling instants are denoted by 0, 1, 2 ... as if they were separated by a 
unit sampling period Tg in order to simplify the notation. 

Remark: 

Note that y(t) depends on u(t — 1), u(t — 2), . . . , but not on u(t)^ because it is 
estimated that the output is not immediately infiuenced by the input, even if 
the system presents no time delay. 

By introducing the delay operator q~^ such that 



q ^y{t) = y(t - i) 



( 10 . 2 ) 
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the output is 



y{t) = '^g{k)u{t - k) = '^g{k)q u{t) 

k=l 
oo 



.k=l 



k=l 

u{t) = G{q) u{t) 



The transfer function of the linear system is 

oo 

G{q)=J2q->^g{k) 



k=l 



SO that the output can be written in the absence of disturbance as 



y{t) = G{q) u{t) 

A linear filter G{q) is strictly stable if 

oo 

'^k\g{k)\ < oo 
k=l 



and stable if 

oo 

\9(k)\ < oo 

k=l 



(10.3) 



(10.4) 



(10.5) 



( 10 . 6 ) 



(10.7) 



10.1.2 Disturbance Representation 

A system is subjected to disturbances, so that the output can never be cal- 
culated even if the input is known. The disturbances may come from the 
measurement noise, from uncontrolled inputs. They are represented by simply 
adding a term to the output 



OO 

y{t) ='^g{k)u{t-k)+v{t) (10.8) 

k=l 

according to the block diagram in Fig. 10.1. 

The output signal y{t) of the system is composed of a deterministic part related 
to the input and a stochastic part v{t) related to the disturbances. In essence, 
the disturbance cannot be predicted and is often described by means of a se- 
quence {e{t)} of independent random variables having some given probability 
density. The disturbance model is then written as 

oo 

"(*) = E h{k) e{t — k) (10.9) 

k=0 

with the frequent hypothesis: h{0) = 1. 
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Figure 10.1: System with disturbance 



Often, the sequence {e{t)} is specihed only by its mean and its variance. 
The mean is equal to the expectation of the variable 

oo 

/j^v = E[v{t)] = E[e{t — k)] (10.10) 

k=0 

while the covariance is equal to 

oo oo 

E[v{t) v{t - t)] = '^'^h{k)h{j)E[e{t-k)e{t-T-j)] 

k=0 j=0 
oo oo 

= EE h{k) h{j)5{k - T - j)\^ (10.11) 

fe=0 j=o 

OO 

= h{k) h{k — r) 

k=0 

where is the variance of e. Of course, it is assumed that 

h{i)=0 ifi<0 (10.12) 

Due to the disturbance, y{t) is a random variable such that 

E[y{t)]=G{q)u{t)^E[v{t)] (10.13) 

In the presence of the disturbance, the output is 

y{t) = G{q) u{t) ^H{q) e{t) (10.14) 

e{t) is often chosen as white noise (its spectral density is constant in all the 
frequency domain). The name of white noise comes from the analogy with 
white light, which contains all frequencies. e{t) is a sequence of independent 
random variables of zero mean and variance A^. 

10.2 Non-Parametric Identification 

10.2.1 Frequency Identification 

As well as in continuous time, where the Bode and Nyquist diagrams allow us 
to characterize the systems and discuss their stability, it is possible, in discrete 
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time, to subject the considered system without the disturbance to a sinusoidal 
input 

u(t) = cos(cjt) = TZe [exp(jcjt)] Vt > 0 (10.15) 

The output is then 



y{t) 



oo 

'^g{k)TZe [exp{juj{t - k))] = Tie 



k=l 



IZe 



oo 

exp{ju}t) g{k) exp{-ju}k) 
k=l 



|(T(exp(jcj))| cos{iot + (/)) 



'^g{k)exp{ju{t - k)) 

.k = l 

= Tie [exp{jujt)G{exp{juj))] 

(10.16) 



where (/) is the phase shift of the output with respect to the input. 

If the signal u{t) is causal (u{t) = 0 Vt < 0), the output is equal to 



y{t) = Tie 



t 

exp{jut) g{k) exp{-juk) 
k=l 



(10.17) 



Thus, it is theoretically possible to determine the discrete transfer function by 
this frequency approach (Ljung, 1987). 

This elementary approach can be improved (Soderstrom and Stoica, 1989) 
by multiplying the output by cos(cjt) on one hand giving a signal tjc, and by 
sin(cjt) on the other hand giving a signal yg. These signals are integrated over 
a period T, which allows us to decrease the noise influence, and results in 







Y 

— 008(0) 



(10.18) 



by setting y{t) = bcos{iot + (/)). 



10.2.2 Identification by Correlation Analysis 

From Eq. (10.8), representing the system in the presence of a disturbance, it 
is possible to get the relation between the correlation functions 

oo 

Ryu{n) = E[y{t)u{t - t)] = '^g{i)Ruu{n - i) (10.19) 

i=l 

In fact, the correlation functions are calculated from the data according to the 
relations (9.72) and (9.73). 

Practically, in relation (10.19), the correlation functions Ryu^ Ruu and the 
coefficients g{i) of the impulse response are replaced by their estimations. By 
knowing the correlation functions, this amounts to solving a system of infinite 
dimension with respect to the coefficients g{i). Moreover, the sequence g(i) is 
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truncated above a given threshold M, so that the linear system to be solved 
becomes 

M 

Ryu{n) ='^g{i)Ruu{n - i) , n = l,...,M (10.20) 

i=l 

Eventually, the system can be solved according to a least-squares method for 
a number of equations larger than M. 

This method thus provides an estimation of the system impulse response 
corresponding to a hnite impulse response (FIR) hlter. 

Example 10.1: Identification by Correlation Analysis for a Chemical 
Reactor 

The chemical reactor described in Chap. 19 has been subjected to a pseudo- 
random binary sequence (Fig. 12.14). A sampling period Tg = 25 s was 
retained. The identihcation by correlation analysis was realized by means of 
Matlab©. The impulse response thus obtained is represented (Fig. 10.2) on 
the same graph as the impulse response from the ARMAX model obtained by 
parameter identihcation (na = 2,n5 = 2,nc = 2). The order of magnitude 
is more or less respected, but the quality of the identihcation by correlation 
analysis is not very good. The ‘o’ points inside the conhdence interval (dashed 
line) for the correlation analysis are neglected. 

14 
1 2 
10 
8 
6 
4 
2 
O 

-2 
-4 

O 2 4 6 8 10 12 14 

lags 

Figure 10.2: Identihcation by correlation analysis with the data coming from 
the chemical reactor simulated with a measurement noise: impulse response 
(symbol ‘o’) obtained with M = 15 compared to the impulse response obtained 
according to an ARMAX model by parameter identihcation (stairs curve) 
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10.2.3 Spectral Identification 

A linear time-invariant system constitutes a linear filter. The signal crossing 
this filter is modified in particular with respect to frequency. Consider a strictly 
stable system G{q) having as input u{t) and output y{t) 

y(t) = G{q)u(t) (10.21) 

The input u(t) is unknown for t < 0 but assumed bounded for all t 

\u{t)\ < Gu Vt (10.22) 

The discrete Fourier transforms of the input and the output are 

N N 

Un{oo) = u{n) exp(— jcjn) and Yn{uj) = y{n) exp(— jcjn) (10.23) 

n=l n=l 

According to Eq. (10.8), provided that the input and the disturbance are 
independent, the spectral and cross-spectral densities are related by the two 
following equations 



^yy{^) = |G(exp(ja;))p$„„(a;) +$„„(a;) 
= G(exp(ja;))$„„(a;) 



The transfer function equal to 



G{lo) 



Yn{uj) 

Un{lo) 



thus can be estimated in the frequency domain as 



G{lo) 



^uu 



(10.24) 



(10.25) 



(10.26) 



with the cross-spectral and spectral densities respectively and rigorously equal 
to 

^ ^ oo 

Ryu{n) exp{-jun) 

< (10.27) 

Ruu{n) exp{-jun) 

^ n=— OO 

In fact, the cross-spectral and spectral densities are, respectively, estimated as 



1 \ ^ 1 

^ Y Ryu{n)exp{-jujn) = ^^Yn{uj)Un{-uj) 

n=—N 

1 ^ ^ 1 

^ Ruu{n) exp{-jujn) = 

n=—N 



< 



(10.28) 
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This approach gives poor results (Soderstrom and Stoica, 1989) because the 
estimation of the cross-correlation is bad when n becomes large. For this reason, 
it is better to apply a window so that these terms Ryu{n) and Ruu{^) (with 
n large) are hltered and become negligible. For example, the hltered cross- 
spectral density becomes 



^ ^j,„(n)w;(n)exp(-ja;n) (10.29) 

n=-N 

This equation can also be used in the case of autocorrelations. The window 
is characterized by the vector w] among different windows, cite the rectangle 
window (Fig. 10.3) such that 



w{n) 



1 if |n| < M 
0 if |n| > M 



and the Hamming window (Fig. 10.3) such that 



(10.30) 



{ Ti 

0.54 — 0.46 cos(27r-— ) if Ini < M 

^ M-V ' ' - (10.31) 

0 if |n| > M 

Many other types of windows exist (triangular, Bartlett, Blackman, Chebyshev, 
Hanning, Kaiser, etc.). With the infinite sequence being truncated by the use 
of the window, the Gibbs phenomenon (oscillations) may occur in a more or 
less important manner according to the type of window and the choice of this 
window width. 




Figure 10.3: Hamming and rectangle windows 



Example 10.2: Influence of the Window on the Spectral Density 

The signal x{t) = cos(27r20t) + 0.5e(t) has been sampled at a frequency of 
200 Hz and considered over 1024 measurement points; e{t) is a random number 
uniformly distributed in [0,1]; thus the signal is noisy. The spectral density 
of this signal obviously makes appear the 20 Hz frequency of the signal (Fig. 
10.4). 
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Figure 10.4: Spectral density of the signal without window 




Figure 10.5: Spectral density of the signal with Hamming window of width 
M = 100 



Equation (10.29) has been applied to the autocorrelation function of the 
signal by multiplying the signal by a Hamming window for which M = 100. 
The spectral density thus estimated is shown in Fig. 10.5. The fundamental 
frequency of 20 Hz appears clearly. The influence of the choice of M can lead 
to relatively different results. 



By estimating the ratio according to Eq. (10.26) of the spectral densities in 
this manner, it is possible to estimate the transfer function of the considered 
system. 

Example 10.3: Spectral Identification for a Chemical Reactor 

The same data as in correlation analysis (Sect. 10.2.2) concerning the chemical 
reactor described in Chap. 19 are used. The sampling period is Tg = 25 s. The 
spectral identification has been performed by means of Matlab©. The Bode di- 
agram of the transfer function obtained by spectral identification is rather close 
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(Fig. 10.6) to that of the transfer function obtained by parameter identihcation 
with an ARM AX model (na = 2, n6 = 2, nc = 2), especially at low frequencies 
where the modulus is large. Note the maximum frequency: u = it/ T s ?^0,125. 




10 ^ 10 ^ 10 '' 10 ° 
Frequency 



Figure 10.6: Spectral identihcation for the data coming from the chemical 
reactor simulated with measurement noise: comparison of Bode diagrams of 
the transfer functions obtained by spectral identihcation and by an ARMAX 
model {na = 2, n6 = 2, nc = 2) (the diagram of the ARMAX model is smooth) 



10.3 Parametric Identification 

10.3.1 Prediction Principles 

In this part, general principles of prediction (Ljung, 1987) are described. In the 
following chapters, they will be largely used in parameter identihcation. The 
system is assumed to be described by the following model 

y{t) = G{q)u{t) ^H{q)e{t) (10.32) 

with polynomials G{q) and H{q) 

oo oo 

G{q) = , H{q) = '^h{i)q~^ (10.33) 

i=l i=0 

The q notation of the hlters (e.g. G{q)) facilitates the discussions about ana- 
lyticity of the corresponding function depending on z (e.g. G{z)). Qualifying 
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the two previous functions as a polynomial is certainly improper, as these are 
polynomials with respect to q~^ and only functions with respect to z can be 
considered as analytic. However, we will often keep this practical designation. 

Concerning simulation, when the previous system was subjected to an input 
Us without disturbance, the resulting output would be equal to 

ys{t) = G{q)us{t) (10.34) 

Similarly, the influence of the disturbance could be simulated by creating a 
random sequence es{t)^ which would be white noise such that the disturbance 
is 

Vs(t) = H{q)es(t) (10.35) 

Then it is possible to know the system response to both input and disturbance. 

10.3.2 One-Step Prediction 

Disturbance Prediction 

Given the model of the disturbance as 

oo 

v{t) = H{q)e{t) = — i) (10.36) 

^=0 

and the associated filter H{q) which is assumed to be stable, as 

oo 

^^|/i(i)|<oo (10.37) 

^=0 

a key-problem in identification is to be able to calculate the sequence e{t) whose 
statistical properties are known (this is white noise). Set 

oo 

e{t) = Hinv{q)v{t) = hinv{i)v{t - i) (10.38) 

^=0 

with the analogous stability condition 

oo 

|/imt;(*)| < oo (10.39) 

i=0 

Let the function H{z) be equal to 

oo 

H{z) =^h{i)z~^ (10.40) 

^=0 

and assume its inverse IjH^z) to be analytical in the domain 1^1 > 1, the 
function of which is defined by 

^ oo 



(10.41) 
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which means that the hlter Hinviq) is stable. The coefficients hinv{'^) are those 
searched for. Then the linear system or following hlter is dehned 

oo 

='^hinv{i)q~' (10.42) 

^=0 

Ljung (1987) shows that Hiny{q) = satishes Eq. (10.38). The coeffi- 

cients of the hlter Hinv can be calculated from Eq. (10.41). The fact that H{q) 
is an inversely stable hlter implies that the function 1 /H (z) has no poles on or 
outside the unit circle, in the same way as H (z) has no zeros on or outside the 
unit circle. 

Remark: the calculation of the hlter coefficients necessitates us to resort to the 
analytical functions with respect to z. 

Example 10.4: Inverse for a Moving Average Filter 

Ljung (1987) considers the following disturbance model 

v{t) = e{t) cie{t — 1) (10.43) 

corresponding to a hrst-order moving average model, which can be described 
by the following hlter H (q) 

H{q) = 1 + ciq~^ (10.44) 



The associated function is 

H{z) = 1 + ciz~^ = ^ ^ (10.45) 

This function, having a pole at z = 0 and a zero at z = — ci, is analytical 
provided that |ci| < 1 (no poles, nor zeros on or outside the unit circle). In 

this case, following the previous method, the inverse hlter can be described by 

oo oo 

= 1+cz-i = (10-46) 

^ ^=0 ^=0 

and the sequence e{t) results as 

oo 

“ *) (10.47) 

^=0 



In the case of one- step prediction, the signal v{t) is assumed to be known up 
to time t — 1, and we wish to predict the value of u(t), thus, one step ahead. 
v{t) is decomposed into two parts, one unknown, the other one known 

oo oo oo 

v(t) = h{i)e(t — i) = h{0)e(t) + h{i)e(t — i) = e(t) + h{i)e(t — i) 

^=0 i=l i=l 

(10.48) 
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with h{0) = 1 {H is monic). The sum term of the right-member is known, as 
it uses only values of e{t) included between 0 and t — 1. Note that 

oo 

h{i)e{t — i) (10.49) 

i=l 

SO that 

v{t) = e(t) ^ w(t — 1) (10.50) 

Denote by v(t\t — 1) the prediction of v(t) realized from the information from 
instant 0 until {t — 1). v{t\t — 1) is the expectation of v{t) 

v{t\t — 1) = E[v{t)] (10.51) 

hence 

v{t\t — 1) = E[e{t)] + w{t — 1) (10.52) 

It follows that the stochastic character of the prediction v{t) depends only on 
the statistical properties of e{t). Suppose that the random sequence e{t) is 
uniformly distributed, thus it is white noise. As the mean of e{t) is zero, the 
prediction of v{t) is equal to 

oo 

v{t\t — 1) = w{t — 1) = h{i)e{t — i) (10.53) 

i=l 

This prediction minimizes the variance of the prediction error (Ljung, 1987), 
thus providing the optimal predictor x(t) 

minE(n(t) — x{t) = v{t\t — 1) (10.54) 

x{t) 

In fact, the expression (10.53) of the predictor is not adapted to calculation, 
as only the signals v{i) are known until t — 1. Thus, the expression (10.53) is 
transformed in order to make the known signals appear according to a classical 
method in identification 

" oo 

v{t\t-l) = '^h{i)q~^ e{t) 

= [H{q) - 1] e{t) (10-55) 

= [H{q)-l]H-\q)v{t) 

= [1 - H-\q)] v{t) 

The coefficients of the filter will be calculated in the same way as 

those of the function = 1/H{z). It is possible to write the following 

equivalent form 

oo 

H{q) v{t\t - 1) = [H{q) - 1] v{t) = h{k)v{t - k) 

k=l 



(10.56) 
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Example 10.5: Predictor of the Disturbance 

Consider again the previous model of disturbance (moving average of order 1) 

v{t) = e{t) cie{t — 1) (10.57) 

hence 



H{q) = l+ ciq 



-1 



The prediction formula 



thus gives 
or still 

v{t\t — 1) = 



H{q)v{t\t - 1) = [H{q) - 1] v{t) 
v{t\t — 1) + Cl v{t — l\t — 2) = Cl v{t — 1) 



— civ{t - 1) = 



^(-ci)*^- 



i=0 



Cl v{t — 1) 



l^ciq 

oo 

- v{t-i-l) = - ^(-Ci)* v{t - i) 



i=0 



i=l 



(10.58) 

(10.59) 

(10.60) 



(10.61) 



Output Prediction 

The input u is known at instant t — 1 and at the previous instants (recall that 
y{t) depends on u{t — 1), u{t — 2), . . .); the output y is known until instant 
t — 1; the output y{t) is sought. With the input-output model being 

y{t) = G{q) u{t) + v{t) (10.62) 

the right-member is composed of a deterministic part G{q)u{t) and a stochastic 
part v{t). The prediction of the output results 

y{t\t — 1) = G{q) u{t) + v{t\t — 1) (10.63) 

thus by using the prediction of the disturbance 

y{t\t-l) = G{q)u{t)+[l-H~^{q)] v{t) 

(10.64) 

= G{q)u{t) + [l - H~^{q)] [y{t) - G{q) u{t)] 

hence, hnally, 

y{t\t - 1) = H~^{q) G{q) u{t) + [l - H~^{q)] y{t) (10.65) 

or in a form making use only of H (q) 



H{q) y{t\t - 1) = G{q) u{t) + [H{q) - 1] y{t) 



( 10 . 66 ) 
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u{t) 



y(t) 



H-\q)G{q) 



l-H-\q) 



+ 



+ 



y{t\t - 1) 



Figure 10.7: Filter of the predictor 



The prediction of the output will be used in regulation; it acts as a linear filter 
having as inputs u{t) and y{t) and as an output the prediction y{t\t — 1) (Fig. 
10.7). 

Remark: 

Assuming that G{z) has no poles on or outside the unit circle, and that H{z) 
has no zeros on or outside the unit circle (stability of 1/H{z)), and knowing 
that 

oo 

G{z) = J29{i)z-^ (10.67) 

i=l 

it is possible to write, by polynomial division, as an infinite (Laurent) expansion 

DO = £»(•)=-• 

On the other hand, recall that 

oo 

Hinv{z) = H~'^{z) = y; hinv{i) z~"‘ with: /im„(0) = 1 (10.69) 

i=0 



From the prediction Eq. (10.65), the predictor denoted by f(t) is rigorously 
written as 



/>) = H-\z) G{z) u{t) + [1 - H-\z)] y{t) 

OO OO 

= '^9h{i)u{t - i) h-inv {i) y{t-i) 

i=l i=l 



(10.70) 



This expression assumes that the signals are known in the time interval [—oo^t— 
1], while, in fact, they are only known in [0,t — 1]. The previous expression is 
then replaced by its approximation 



t t 

y{t\t - 1) w y; gh{i) u{t ~i) hinv{i) y{t - i) 

i=l i=l 



(10.71) 



which amounts to replacing all the values corresponding to times larger than 
t — 1 by 0. As the coefficients gh{i) and hinv{i) decrease quasi-exponentially, 
this approximation gives satisfactory results. 
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Prediction Error 

The a priori prediction error is the difference between the output at time t 
and the prediction of this output made at time t from the previously known 
outputs, as 

y{t) - y{t\t - 1) = -H~^{q) G{q) u{t) + H~^{q) y{t) (10.72) 

We notice that this prediction error corresponds to the non-predictible part of 
the output 

y(t) - y(t\t - 1) = e(t) (10.73) 

The a priori prediction error is also called the innovation at time t. 



10.3.3 p-Step Predictions 

Disturbance Prediction 

The problem is then to predict the disturbance v{t-\-p), knowing v{i) for i <t. 
The disturbance is related to white noise by the relation 



j(t + p) = ^ h(i) e(t + p - i) 



(10.74) 



^=0 



This sum can be decomposed into two parts, the second being known at time 
t, the first unknown but of zero mean 



p-i 



v{t +p) = E h{i) e(t + p — i) + h{i) e(t + p — i) 

^=0 i=p 

= Hi{q)e{t +p) + Hc{q)e{t) 
by defining the partial sums 



p-i 



Hi{q) = h(i) q * and: Hc{q) = h{i) q 

i=0 i=p 

The expectation v{t p) is then equal to the known part 

oo 

i){t + p\t) = h{i) e{t + p -i) = Hc{q) e{t) 



,p—l 



l=P 



(10.75) 



(10.76) 



(10.77) 



thus giving the expression of the p-step prediction by using the relation e{t) = 
H-\q)v{t) 



v{t + p\t) = Hc{q) H ^{q)v{t) 



(10.78) 
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Output Prediction 

The p-step output y(t + p) is given by the relation 

y{t + p) = G{q)u{t + p) + v{t + p) (10.79) 

With the inputs u being known from — oo tot + p — 1 and the outputs y from 
— oo to t, we obtain 

y(t + p\t) = G{q) u(t + p) + v(t + p\t) 

= G{q) u{t +p)+ Hc{q) H~^{q) v{t) 

= G{q) u{t +p)+ Hcisi) H-^{q) [y{t) - G{q) u{t)\ 

= HM H-\q) y{t) + G{q) [l - q~P H,{q) H~\q)] u{t + p) 

(10.80) 



Prediction Error 



The a priori p-step prediction error for the output is equal to 



y{t +p)~ y{t+p\t) 



[1-q PHc{q)H ^{q)][y{t + p) - G{q) u{t + p)] 
[1-q-PHM H-\q)]v{t+p) 

[1-q-PHM H-\q)]H{q)e{t+p) 
[H{q)-q-PH,{q)]e{t+p) 

Hi{q)e{t+p) 



and thus depends on the successive noises e{t + 1), e{t + p). 



(10.81) 
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Chapter 11 

Models and Methods for 
Parametric Identification 



The estimation of the parameters of a given model is called parametric identih- 
cation. For the notions of identihability (the possibility of obtaining the param- 
eters, for a certain number of experimental data) and of distinguishability (the 
possibility of distinguishing between distinct structures of models), the reader 
may refer to the following texts: Walter (1987); Walter and Pronzato (1997). 
The methods proposed in this chapter essentially concern time-invariant linear 
systems. Parametric identihcation is presented in the cases of transfer function 
and state-space systems. 

11.1 Model Structure for Parametric 
Identification 

It is necessary to choose a type of model before proceeding to the system 
identihcation. After the choice of the model structure, it will be possible to 
estimate the parameters of this model. 

11.1.1 Linear Models of Transfer Functions 

The output of the single-input single-output time-invariant linear system, sub- 
jected to a disturbance (Fig. 11.1), is modelled as 

y{t) = G{q) u{t) + v{t) = G{q) u{t) +H{q) e{t) (11.1) 

This model is completely characterized by the coefficients of the impulse 
response g{k), the spectral density of the disturbance ^^;(cj) = A^|i7(exp(icj))p, 
and the probability density function of the noise e{t). The transfer functions 
are 

oo oo 

G{q) = '^g{i)q~\ H{q) = 1 + '^h{i) q~^ 

i=l i=l 



( 11 . 2 ) 
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u(t) 



G 





e{t) 


H 


+ 

+ 


v{t) 



y{t) 



Figure 11.1: Single-input single-output linear system subjected to a distur- 
bance 



We notice that H{q) is monic (/i(0) = 1). In general, the number of coefficients 
in the expansion is taken to be finite. Moreover, the probability density function 
is not given, but, for example, it is Gaussian or its two first moments, the mean 
and the variance, are known 

E[e{t)]=Q, E[{e{t)-0f]=X'^ (11.3) 

The coefficients to be determined, in general, are not known through the know- 
ledge of the physical model (although this can be very helpful in some cases), 
but by a black-box representation, thus demand estimation techniques and en- 
ter into the model as parameters to be determined. The parameter vector 
is denoted by 6 and depends on the used identification method. The system 
model could be written in these conditions as 

y{t) = G{q, e) u{t) +H{q, 0) e{t) (11.4) 

and, in fact, covers a family of models dependent on 0. This general model is 
probabilistic, as the probability density function is specified through e{t). 

It is advisable to compare this model to the prediction model where the 
output depends only on the past inputs and outputs (cf. Eq. (10.65)) 

y{t\t - 1) = H~^{q, 0) G{q, 0) u{t) + [l - H~^{q, 6^)] y{t) (11.5) 

Equation Error Models 

Autoregressive exogenous ARX model: 

This model, which is the simplest, is written in the form of the following dif- 
ference equation 

y{t) + ai ^(t - 1) + . . . + y{t - Ua) = bi u{t - 1) ^ ^ bn^ u{t - nt) + e{t) 

(11.6) 

or still 

e{t) =y{t) +aiy{t-l)+ ... + an,, y{t - Ua) - biu{t - 1) - ... - bn, u{t - Ub) 
=A{q) y{t) - B{q) u{t) 



(11.7) 
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by setting the polynomials 

A{q) = 1 +ai q~^ + . . . q~"^" and: B{q) = bi q~^ + . . .+6^^ q~"^^ (11.8) 

In the expression of B{q), we integrated the fact that the output is considered 
to be always delayed with respect to the input by at least a sampling period. 

The error term e{t) concerns the totality of the equation and appears as a 
moving average; for this reason, this type of model is also called an equation 
error model (Fig. 11.2). 




Figure 11.2: Principle of equation error 
The parameter vector to be determined is 

0 = [ai , . . . , , bi , . . . , (H-9) 

The model is thus of the form 

Mq) y(t) = B{q) u{t) + e{t) (11.10) 

thus 

^ ^ G(^,6')M(t) +H{q,0)e{t) (H-H) 

giving the transfer functions G and H (Fig. 11.3) 

Gto,0) = fK. «(«.«) = 

The model and the disturbance thus possess the same dynamics, specihed by 
the denominator A{q). 

The model is called ARX, as the part A{q) y(t) is the regressive part in the 
expression of y(t) and B{q) u(t) is the exogenous part (external input). 

The predictor associated with this model can be easily deduced from Eq. 

(11.5) 

y{t\e) = [1 - A{q)] y{t) + B{q) u{t) (11.13) 

The observation vector is 



4>{t) = [-y{t - 1) , ■ ■ ■ , -y{t - Ua) , u{t - 1) , . . . , u{t - nb)f (11-14) 
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Figure 11.3: Structure of ARX equation error model: autoregressive exoge- 
nous 



The predictor of the output is thus the scalar product of the parameter vector 
by the observation vector 

y(t\e) = 4^ it) e = 4>{t) ( 11 . 15 ) 



With the predictor being a linear function with respect to the parameters, 
the problem is a linear regression problem and the parameters can be searched 
by least-squares procedures. 

ARMAX model: autoregressive moving average exogenous: 

In order to better describe the disturbance term, rather than the ARX model, 
the ARMAX model is often preferred where the transfer function related to the 
disturbance will be the ratio of two polynomials according to the relation 



yit) 






m 



m 



G{q, 9) u{t) + H{q, 9) e{t) 



corresponding to the transfer functions G and H (Fig. 11.4) 



G{q,9) 



B{q) 
A{q) ’ 



H{q,9) 



cjq) 

A{q) 



(11.16) 



(11.17) 



In the ARMAX model, as in the ARX model, the model and the disturbance 
possess the same dynamics, specified by the denominator A{q). 

The model expressed as a difference equation is thus 



y{t) + ai - 1) + . . . + an^ y{t - Ua) = - 1) + . . . + bn^ u{t - n^) 

+ e{t) + Cl e(t - 1) + ... + Cn, e{t - Uc) 

(11.18) 

with the polynomial 

C(^) = 1 +Ci^“^ + ... (11.19) 

With respect to the ARX model, the parameter vector is increased by the 
coefficients ci, . . . , c^^ of C{q). 

To obtain the predictor of the output for this ARMAX model, it suffices to 
replace e{t) in the previous model by [y{t) — y{t\0)], hence 






Ajqy 

C{q). 



y{t) + 



B{q) 

C{q) 



u{t) 



( 11 . 20 ) 
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Figure 11.4: Structure of the equation error model ARMAX: autoregressive 
moving average exogenous 

It is also possible to use the general predictor of Eq. (11.5). The previous 
expression, not being linear with respect to the sought parameters, is linearized 
by multiplying by C{q) so as to obtain an equation of type (11.15) 

C{q) yit\0) = [C{q) - A{q)] y{t) + B{q) u{t) (H-21) 

which amounts to hltering the input and the output by C{q). In fact, the 
predictor is presented in the linear form deduced from the previous expression 

y{t\6) = y{t\6) + [C{q) - A{q)] y{t) + B{q) u{t) - C{q) y{t\6) (11.22) 

which is reordered according to the hnal linear form of the predictor 

y{t\9) = [1 - A{q)] y{t) + B{q) u{t) + [C{q) - 1] \y{t) - y{t\d)] 

= [1 - Mq)] yit) + B{q) u{t) + [C{q) - 1] e{t, 0) 

SO as to make the prediction error intervene (Fig. 11.5). The error is called a 
priori if we take 6>(t — 1), a posteriori if we take 0{t) 

= y{t) -y{t\e) (11.24) 



y{t) 



Process 



u(t) 



+ 



Prediction 



Prediction 

Model 



Error 



y{t) 



Figure 11.5: Principle of prediction error 

We notice that the differences C{q) — 1 and 1 —A{q) make use of the coefficients 
Ci and ai with i < 1. The observation vector is deduced from the prediction 
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model, which depends on the parameter vector through the prediction error 
e(t, 0) 

= [-y{t -1), -y{t -na),u{t-l), ...,u{t-nb) , 

e{t - 1,6'), . . ., e{t - nc,6)f 

and the parameter vector 

6^ = [ai , . . . , , 6i , . . . , , Cl , . . . , c„J^ (11.26) 

which allows us to write the predictor in the classical form 

y(t\0) = 4F{t,0)e = e'^ 4>{t,0) ( 11 . 27 ) 

This is then called a pseudo-linear regression (Ljung, 1987). 

The ARMAX model itself includes an autoregressive term A{q)y{t), an in- 
put term B{q)u{t), a moving average term C{q)e{t), and takes different forms 
according to the particular values of the coefficients: 

• AR (autoregressive) model if = Uc = 0. The output is expressed as a 
pure time series without any input signal 

MQ)y(t)=e(t) (11.28) 

The associated predictor is equal to 

y{t\0) = [1 - A{q)] y{t) (11.29) 

• MA (moving average) model if = 0. The output is equal to 

y(t) =C{q)e(t) (11.30) 

and does not depend on the input. The associated predictor is equal to 

ym = [C{q)-l]em (11.31) 

• ARM A (autoregressive moving average) model if = 0. The output is 
expressed as the relation 



A{q)y{t) = C{q)e{t) (11.32) 

and simply describes the inffuence of a disturbance in a general manner. 
The associated predictor is equal to 

y(t\0) = [C{q) - 1] e{t\0) + [1 - A{q)] y{t) (11.33) 

• ARIMA (autoregressive integrated moving average) model if = 0 and if 
we force the factor A{q) to contain as a factor an integrator term {1 — q~^) 
(useful in suppressing offset in control). The output is expressed according 
to the relation 

A{q)y(t) = C{q)e(t) (11.34) 

The inffuence of the disturbance, including this integrator, tends to describe 
a shift effect. The associated predictor is equal to 

y{t\0) = [C{q) - 1] e{t\0) + [1 - A{q)] y{t) 



(11.35) 
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• FIR (finite impulse response) model if Ua = ric = 0. The output is simply 
equal to 

y{t) = B{q)u{t) + e{t) (11.36) 

The associated predictor is equal to 

y(t\0) = B(q)u(t) (11.37) 

• ARX (AutoRegressive eXogenous) model if Uc = 0. The output is expressed 
according to the relation 

Mq) y(t) = B{q) u{t) + e{t) (11.38) 

The associated predictor is equal to 

y{t\9) = [1 - A{q)] y(t) + B{q) u{t) (11.39) 

• ARARX model. This is obtained from an ARX model by replacing the 
error term taken as a moving average by an autoregressive error term. The 
ARARX model is thus written as 

A{q) y{t) = B{q) u{t) + e(t) (11.40) 

The associated predictor is equal to 

y{t\0) = [1 - A{q) D{q)] y{t) + B{q)D{q) u{t) (H-41) 

• ARARMAX model. This is obtained by using an AutoRegressive Moving 
Average type (ARMA) for the equation error giving the equation 

A{q) y{t) = B{q) u{t) + e(t) (11.42) 

The ARARMAX model constitutes the more general case (Fig. 11.6) of the 
equation error models. The associated predictor is equal to 

y{t\0) = [C{q) - 1] e{t\0) + [1 - A{q) D{q)] y{t) + B{q) D{q) u{t) (11.43) 

All the models previously described include the transfer functions G and 
having the same polynomial A{q) in their transfer functions. This may appear 
as a limitation. ARARX and ARARMAX models, moreover, include the poly- 
nomial D{q) in the denominator of the transfer function relative to the error. 
For this reason, other types of models have been developed. 

Output Error Models 

Instead of having the error related to the equation as in the ARM AX model, it 
may seem more natural to introduce an error similar to a measurement error, 
related to the output y. The models, called output error models, will belong 
to this class (Fig. 11.7). 
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Figure 11.6: General equation error model 



Model 



F{q)y{t) = B{q)u{t) 



w(t) 



u{t) 



+ 



Output 

Error 



Process 



yjt) 



Figure 11.7: Principle of the output error 



Elementary output error model: 

The simplest output error model that can be developed will be in the form 

y W = u{t) + (11.44) 

where the error e{t) bears only on the output y{t), hence the name of this model 
(Fig. 11.8). In the ARX equation error model, the process transfer function 
was B{q)/A{q)] in this output error model, it is denoted by B{q)/F{q) to 
distinguish the different roles played by the polynomials A{q) and F{q) in the 
model structure. If we call the output without error 

w(t) = y(t) - e(t) (11.45) 

the difference equation associated with this model is written as 

rc(t) + /i rc(t-l)+ . . . ^frif w{t-nf) = hiu{t-l)^ . . . ^hn^,u{t-nh) (11.46) 

Notice that w(t) is an internal variable (Fig. 11.8), which is not observed and, 
in fact, depends on the parameter vector and thus will be denoted by w(t^6). 
The predictor results immediately from the model 
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e{t) 



u{t) 


B{q) 


w{t) + 




F{q) 





y{t) 



Figure 11.8: Elementary output error model 



Taking as the parameter vector 

0 = [fl , ■ ■ ■ , fnj , bi , . . . , bn„f (11.48) 

and as the “observation” vector 

6) = [—w{t — 1, 6>), . . . , —w{t — rif^ 6>), u{t — 1), . . . , u{t — n^)]^ (11.49) 
the classical regression relation can be written 

y{t\9) = (jF {t^9) 9 = 9^ (j){t^9) (11.50) 



Box- Jenkins models: 

The previous model can be improved by introducing a transfer function for 
white noise e{t) so that we obtain 



y{t) 






F{q) 



D{q) 



(11.51) 



By using the general equation of the predictor (11.5), we get the predictor 
associated with this model 



y{t\^) 



D{q)B{q) 

C{q)F{q) 



u(t) + 



Djqy 

C{q). 



y{t) 



(11.52) 



which could be transformed (refer to the general model) to lead to a pseudo- 
linear regression equation. By replacing e{t) by y{t\9) — y{t), the same expres- 
sion of the predictor would be obtained. 



General Model for Identification 



The most general model (Fig. 11.9) that can be written is 

This structure (Ljung, 1987) includes the different previous models, the equa- 
tion error as well as the output error ones. Thus, if it covers all cases, it is not 
well adapted to each particular case. However, it is interesting to develop the 
identification methodology for the most general model. If, moreover, the pro- 
cess includes a time delay of sampling periods between input u and output 
y, the model is modified by making the delay explicitly appear as 



Mq) y{t) = q 



n,. m 

F{q) 



u(t) + 



cjq) 

D{q) 



e{t) 



(11.54) 
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Figure 11.9: General model 



the polynomials being the following 



A{q) 


— 1 CLi q ^ + . . 


■ + 


B{q) 


= biq-^ + ... + b, 




C{q) 


= 1 + Cl q~^ + . . . 


+ c„, 


D{q) 


= 1 di q ^ + . . . 


+ dn, q-^“ 


F{q) 


= l+/i^-i+... 


+ /n/ 



(11.55) 



According to the general equation of the predictor (11.5), the expression of 
the predictor results 






' D{q)B{qy 

C{q)F{q)_ 



u(t) + 



C{q) 



y{t) 



(11.56) 



This expression can be written in a recursive form by multiplying both members 
by C{q) F{q). We then get a nonlinear equation with respect to the parameters 



C{q) F{q) y{t\e) = D{q) B{q) u{t) + F{q) [C{q) - D{q) A{q)\ y{t) (11.57) 



From Eq. (11.56), the prediction error results 



e{t,0) = y{t) - y{t\0) 



Djq) ■ 
C{q) . 



Mq) y{t) 



B{q) 

F{q) 



u{t) 



(11.58) 



It is necessary to use auxiliary variables so that the final result is a pseudo- 
linear regression, that is, a linear equation with respect to the parameters 
that are to be determined. To lead to a simple form, it is necessary that the 
auxiliary variables allow the decoupling of the products of polynomials. Thus 
we successively introduce (each new variable depends on the known variables 
IX, ^ or on the previous variables) 

w{t^0) = 

v(t, e) = 

giving the prediction error 



F{q) (11.59) 

Mq) y(t) - w(t, e) 



e{t,0) 



Djq) 

C{q) 



v{t, 0) 



(11.60) 
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The set of Eqs. (11.58), (11.60), (11.64), allows us to write the predictor 

y{t\0) =y{t)-e{t,0) 

= [A{q)y{t) - A'^ ■ Y] - [C{q)e{t, 6) - C'^ ■ E] 

= v{t, 9) + wit, 9)-A''^-Y- Diq)v{t, 9) + ■ E 

' = [vit, 9) - Diq)v{t, 6 ^)] + [Eiq)wit, 9) - ■W]-A''^-Y + C''^-E 

= -D'^ ■ V + B(q)u(t) - E'^ -W - A'^ -Y + C'^ ■ E 
= -D'^ - V + B''^ - U - E''^ -W - A''^ -Y + C^ ■ E 

(11.61) 

leading to the usual linear form of regression 

y{t\9) = 9’^ (pit,9) (11.62) 

To obtain expression (11.61), the vectors (/) and 6 were symbolized by making 
use of internal sub- vectors 

<j)=[-Y ,U , -W , E , -Vf , 9 =[A' , B' , E' , C , D'f (11.63) 



where T, [/, E, V, W are the vectors truncated from the values y(t), e(t), v(t), 
w(t), and A', F', (C', D' are the vectors symbolizing the monic polynomials 
whose coefficient 1 of order 0 was taken off. B' is the vector symbolizing the 
polynomial B. Therefore, we can write the relations 



Aiq)y{t) = ■ y + y{t) 

Biq)uit) =B''^ -U 

< Ciq)eit, 9) =C^ -E + e{t, 9) 

Diq)vit, 9) =D''^ -V + v{t, 9) 

Eiq)w{t,9) = E''^ -W + w{t,9) 



(11.64) 



where • symbolizes the scalar product of the vectors. 
The observation vector is then 



= \-yit-l), -yit-na),uit-l), ...,uit-nh), 

—w{t — 1 , 6 >), . . . , —w{t — rif^ 6 >), e{t — 1 , 6 >), . . . , e{t 
-v{t -1,6'),..., -v{t - Ud, 9)f 



and the associated parameter vector is equal to 



ric,9), 



(11.65) 



9 = [ai , . , 



5 fl 5 ■ ■ ■ 5 f^f ’ 5 ■ ■ ■ 5 5 



d 



T 

'Tldj 

(11.66) 



11.1.2 Models for Estimation in State Space 

The advantage of the state-space representation coming from the fundamental 
equations is that it is closer to the physical process than the transfer function 
models. Thus, the parameter vector can be related to the physical parameters 
that govern the system. This may allow us to better control the variation 
domains of the parameters. 
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Discrete Kalman Filter 



The process is represented in state space and discrete time for a multi-input 
multi-output system, taking into account the measurement noise Vk and the 
process noise Wk (concerning the states), by the following model 



Xk-\-i = AkXk -\- Bk Uk + Gk Wk 

Vk =CkXk~^Vk 



(11.67) 



This model, in general, comes from the discretization of the continuous-time 
state-space model. The parameters of the model are the coefficients of matrices 
Ak, Bk^ Ck- The considered instants, denoted by subscript /c, thus correspond 
to a sampling of period Tg. The noises Vk and Wk are supposed to be sequences 
of independent random variables of zero mean and of covariances 



E [wk wf] 


= Qk^kl 




E [vk vf] 


= RkSki 


( 11 . 68 ) 


E [wk vf] 


= SkSki 





where Ski is the Kronecker symbol (= 0 if /c 7 ^ /, = 1 if /c = /). The matrices 
Qk and Rk are symmetrical positive definite. Frequently, the measurement 
and process noises are assumed to be uncorrelated, so that in this case Sk = 0. 
The case where these noises are correlated will be explained later. 

The initial state Xq of the system must be specified, and is such that 



E [(*o) (*o)^] = Po 



(11.69) 



with X = X — E{x). The matrix Pq is positive definite. 

The noises v{t) and w{t) are any when the filter minimizes the a priori 
variance of the estimation error; they are assumed to be Gaussian when the 
filter maximizes the a posteriori probability of the variables to be estimated 
(Borne et ah, 1990; Radix, 1970). 

According to Eqs. (11.67,) which define a recurrence relation, it is possible 
to calculate the state and the output at instant k with respect to the initial 
conditions and the sequence of the inputs and the noises 



k-l 



^k — ^k,0^0 T ^ ^ ^k,i-\-l[BiUi T GiWi] 

i=0 

[ Vk= CkXk ^Vk 
where ^k,i is the state transition matrix defined by 

^k,i — Ak—iAk—2 • • • Ai if: k ^ i ] ^k^k — I 



(11.70) 



(11.71) 



The Kalman filter (Kalman, 1960; Kalman and Bucy, 1961), allows us to 
calculate the prediction of y{t) according to the equations 



Xk+i = AkXk ^ Bk Uk + Kk [Vk - GkXk] 

Vk = GkXk 



(11.72) 
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where Kk is the Kalman gain matrix and is the real measurement performed 
at instant k. This measurement is often denoted hy Zk to distinguish the set 
{y} of all the outputs from the set {z} of the measured outputs. 

In fact, the use of the Kalman hlter is done iteratively in two stages 
• Prediction stage: 



^k-\-l\k — -^k ^k\k T ^k '^k 

Pk-\-l\k = ^k Pk\k ^"k GkQkG^^ 

Correction stage (improvement of the estimation) 



(11.73) 



^/c+l|/c+l — ^k-\-l\k T -^/c+1 (?//c+l ^/c+l^/c+l|/c) 

Pk^l\k^l = {I — Kk^lCk^l)Pk^l\k{I — ^/c+lC'/c+i)^ + Kk^lRk^lKj^^i 

(11.74) 

Demonstrations: 

1) By dehnition, the covariance matrix of the one-step predicted error is 



Pk-\-l\k — E[(iC/j,_|_l ] 

In the prediction stage, it results that 



(11.75) 



Pk-\-l\k — (iC/j, ^k\k) T ^/c|/c) T ] 



= AkE[{xk - Xk\k){xk - Xk\kV]Al + GkE[wkwl]Gl 
= AkPk\kAj + GkQkGj 

which is indeed formula (11.73). 

2) By dehnition, the covariance matrix of the hltered error is 



As 



Pk\k — ~^[{^k ^k\k){^k ^k\k) ] 

^k\k ~ ^k\k — l T R k {y k — l) 

— ^k\k — l T R k {G T V]^ C — ^ 



(11.76) 

(11.77) 

(11.78) 



it results that 



^k\k ^k — {I -^/c^/c) (^/c|/c — 1 ^/c) T -K'/c'U/c (11.79) 

giving the covariance matrix of the hltered error 

Pk\k = {I — KkGk)Pk\k-i{I — KkGk)^ + KkRkK^k (11.80) 

which is indeed formula (11.74). 

The gain matrix of the hlter is calculated in order to minimize the following 
criterion 

tmce{Pk\k) ( 11 . 81 ) 
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The Kalman filter is thus an optimal linear filter and the optimal gain minimizes 
the sum of the variances of the estimation errors, by definition of the trace^. 
The trace represents the confidence that we have in the previous estimation; 
the greater the confidence, the smaller the trace and the lower the gain. On 
the contrary, if we wish to take into account new measurements, the gain must 
not be too low. The gain thus represents, in fact, a compromise. 

The minimization of the criterion gives the optimal gain matrix 



9trace(Pfe|fe) 



= 0 Kk = Pfelfe-iCfeS^ 



(11.82) 



with 

j:k = CkPk\k-iCl + Rk (11.83) 

By use of the matrix inversion lemma the following expression, which is 
frequently useful, is obtained 

Sp = Ck py, ClRl^ (11.85) 

By use of this expression of the optimal gain, the covariance matrix of the 
filtered error, cf. Eq. (11.74), can be simplified as 

Pk\k = {I - KkCk)Pk\k-i ( 11 . 86 ) 

This relation has an important consequence 

trace(P/e|/c) < trace(P/e|/c-i) (11.87) 



thus the trace decreases along the iterations. 

The optimal gain can also be written again in the form 

Kk = Pk\kClRl^ ( 11 . 88 ) 

thus as the covariance of the filtered error {xk — Xk\k) becomes larger, the gain 
increases for a given error matrix Rk. 

Eq. (11.80) of the filtered error covariance matrix is called the Joseph form 
(Borne et ah, 1990), and is more robust numerically (in particular, it guarantees 
that the matrix remains symmetrical positive definite along the iterations) than 
the simplified relation (11.86). 

It is also possible to write the matrix inversion lemma 

n'll = n'll-i + ClR-k^Ck (11.89) 

The Kalman filter can be considered in two different ways (Borne et ah, 
1990; Watanabe, 1992): as a filter or estimator if we first make the prediction 

^The trace of a matrix is equal to the sum of the diagonal elements of the matrix. It is 
also equal to the sum of the eigenvalues of this matrix. 

^The matrix inversion lemma concerning matrices A, B, C, iD, gives the following equality 

{A + BCD)-^ = A~^ -A-^B{C~^ + D A~^ B)~^ D A~^ (11.84) 
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then the correction, or as a one-step ahead predictor if we hrst apply the cor- 
rection then the prediction. The hlter is currently used as an observer in the 
control laws, needing the knowledge of the states. The predictor is useful for 
studying the stability of the Kalman hlter (Watanabe, 1992). In both cases, 
the prediction and correction formulae are identical. On the other hand, the 
explanation of the stages leads to two different sets of equations: 

Kalman filter as an estimator: 

Knowing the last estimation we first apply the prediction giving 
then the correction giving from which we deduce, after simplifications 

— (-^ t^k-\-l ^/c+l) {-^k ^k\k T ^k '^k) T -^/c+1 2//c+l 5 ^0|0 — ^0 

5^/c+i = Rk-\-i + Ck-\-i[AkPk\k + GkQk 

K k+1 = [Ak Pk\k AI + Gk Qk gI ] ^kh 

P k-\-i\k-\-i = [I — Kk-\-iGk-\-i][AkPk\k A^ GkQkG^^] ; Pq\q = Po 

(11.90) 

Kalman filter as a one-step ahead predietor: 

We first apply the prediction giving Xk-\-i\k with respect to the estimation Xk\k^ 
then we replace this estimation by the correction formula with respect to the 
previous prediction Xk\k-i^ from which we obtain, after simplifications 

^k-\-l\k — Ak Xk\k — 1 T Pk '^k T Ak Kk [Vk Gk ^k\k — l] 5 ^ 0 | — 1 — ^0 

Kk = Pklk-lG^^ [Ck Pklk-lG^^ -\- Rk] ^ 

(11.91) 

with the covariance matrix of the predicted error solution of the Riccati discrete 
equation 



P k-ki\k — Ak Pk\k-i A^ + Gk Qk G"^— 

Ak Pk\k-1 G"^ [C k P k\k-l G^k + Rk] ^Gk p"k\k-l 5 ^ 0|-1 = ^0 

(11.92) 

We notice in Eq. (11.91) that the gain of the predictor filter is equal to AkKk 
which is thus the product of the state matrix Ak by the gain Kk of the estimator 
filter. 

The case where the measurement and process noises are correlated, i.e. 
*S'/c 7^ 0, can be reduced to the simplified case Sk = 0 by introducing the new 
model equivalent to the system (Watanabe, 1992) 

Xk^i = AkXk ^ Bk Uk ^GkWkA Gk Sk Rk^[yk ~ CkXk - Vk] 

< = A'j^ Xk + Bk Uk + G'j^ + Gk w'^ 

^ yk = GkXkA Vk 

(11.93) 

with the new process noise which is not correlated with the measurement 
noise Vk 

w'^ = GkWk - Gk Sk Rk^Vk 



( 11 . 94 ) 
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and 



E 



Vk 



A', =Ak-GkSkRk^Ck 
G', =GkSkR~k^ 



w 



fT 



Q'k 0 

0 Rk 



^ki 



Qk 



Qk ^k Rk ^ 



s 



T 

k 



(11.95) 



In these conditions, the Kalman filter can be solved as previously, but with this 
new model. From the equations of prediction (11.73) and correction (11.74), it 
is possible to notice that only the prediction stage is modified by the introduc- 
tion of the new model. 

Kalman filter as an estimator: 

The formulae of the estimator result very simply from the formulae (11.90) of 
the estimator without correlation of the noises, by taking into account the new 
matrices and the term G'kVk- 



— (-^ R-k-\-i G k-\-i) (y-^k Gk Sk Rk G k^ ^/c|/cT 

{I - Kk^i Ck^i) Bk Uk^ 

(-^ G k-\-i)Gk Sk Rk Vk R- k-\-i yk-\-i 5 ^o|o ^0 

Sfe+i = Rk+i + Gk+i[{Ak — Gk Sk Rk^ Gk)Pk\k {Ak - Gk Sk ^ Ck)'^ 

+Gk {Qk-SkRk^Sl)Gl]Gl+, 



Kk+i = [{Ak — Gk Sk Rk ^ Gk) Pk\k [Ak — Gk Sk Rk ^ Cfe)^ 
+Gk Qk Gl - Gk Sk Si Gl\Gl^, 



-Pfe+i|fe+i = [I — Kk+i Gk+i] [{Ak — Gk Sk Rk ^Gk) 

{Ak - Gk Sk Rk^ Gk)^ + Gk{Qk - Sk R^^ Sl)Gl \ ; Po|o = -Po 

(11.96) 



Kalman filter as a one-step ahead predietor: 

The formulae of the predictor are obtained in the same manner as the formulae 
(11.91) and (11.92) of the predictor without correlation of the noises, by replac- 
ing the old matrices Ak, Qk with the new matrices A^, and by considering 
the term G'kVk- The intermediate calculations are cumbersome; they use, in 
particular, the formulae of as in Eq. (11.85). The final predictor results 
from 



^/c+ii/c — \-^k Lk C k\ ^k\k—i T Bk Uk T Lk yk 

T -1 (11.97) 

Lk — \Ak Pk\k—i G k T Gk ^k ’ ^o| — 1 — ^0 

with the covariance matrix of the predicted error solution of the discrete Riccati 
equation 

P k-\-l\k = ^k Pk\k-1 ^ + Gk Qk G"^ — [Gk Sk + Ak Pk\k-1 ^ Gk] 

[Gk Pk\k-iG"^ A Rk] ^[Gk Sk ^ Ak Pk\k-i Gk]^ Po\-i = Po 

(11.98) 




Process Control 



403 



The prediction of the output is equal to 

Vk = Ck [ql - Ak LkCk] ^ [Bk Uk + Lk yu] (11.99) 

Many algorithms allow us improvement of the numerical implementation of 
the Kalman hlter (Borne et ah, 1990; Bozzo, 1983; Favier, 1982). 



Innovation Representation 

The innovation represents the part of y(t) which cannot be predicted from the 
old values. The error between the real output and the predicted output, or 
output error prediction, is called the innovation form and is equal to 



yk\k-i — Vk~ yk\k-i 

~ yk ^ k ^k\k — l 

— ^k (y^k ^k\k — l) T '^k 

— C — l T V}^ 



(11.100) 



According to Eq. (11.91), the state representation can be written again in 
the innovation form 



^k-\-i\k — A}^ T Bj^ T A]^ K ]^y 

yk = ^k Xk\k-1 + yk\k-l 



( 11 . 101 ) 



Noting that e{t) = yk\k-i^ model can be brought closer to the transfer 
function model (11.4) by setting 



y{t) = G{q, 9) u{t) + H {q, 9) e{t) 
G{q,e) = Ck[ql-Ak]-^ Bk 
H{q,6) = Ck[qI-Ak]-^AkKk+I 



The covariance of the innovation form is equal to 



Uk\k-i 



= E 



yk\k-iyk\ 



k-l 



— ^\[yk~ yk\k-i][yk ~ yk\k~iV'^ 

— E [C T [C T 'y/c] j* 

= Ck Pk\k-i + Rk 



(11.102) 



(11.103) 



This matrix gives information on abnormal phenomena which may occur during 
observation. In effect, if a component of the innovation form yi^k\k-i becomes 
larger than n times the standard deviation cTyy,/c deduced from the matrix 
U k\k-ii fhis may indicate a sensor failure, a model change, etc. 

Example 11.1: Parameterization of the Innovation Form 

The reasoning takes place in an example. Consider an ARMAX model for 
which ria = ni) = Tic = 2] this model is written as 

y{t) + ai y{t - 1 ) + a 2 y{t - 2 ) = hi u{t - 1 ) + 62 u{t - 2 ) 

+e(t) + Cl e{t - 1 ) + C 2 e{t - 2 ) 

(11.104) 
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To transform it into the state representation form, we set the estimator of the 
state vector 

aP" (t) = [xi{t) X2{t)] ( 11 . 105 ) 

such that its first component verifies 

xi{t) = y{t) — e{t) ( 11 . 106 ) 

For this reason 



Xi{t) 



—ai y{t — 1 ) — y{t — 2 ) + u{t — 1 ) + 62 u{t — 2 ) 
+ci e(t - 1) + C2 e(t - 2) 

—a\ xi{t — 1) — tt2 y{t — 2 ) + u{t — 1) + 62 u{t — 2 ) 
+(ci - ai) e(t - 1 ) + C2 e(t - 2 ) 



( 11 . 107 ) 



We then set 



xi{t) = —a\ xi{t — 1 ) -\- bi u{t — 1) + (ci — ai) e{t — 1) + X2{t — 1) (11.108) 

giving the second component of the state vector 

X 2 {t) = -«2 y(t - 1) + 62 u(t - 1) + C2 e(t - 1) 

= —a 2 Xi{t — l)^h 2 u{t — l) (11.109) 

+ (C2 - U2) e{t - 1) 

In summary, the system of equations from which we draw the state represen- 
tation is 

xi{t-\-l) = -aiXi{t)-\-biu{t)-\-{ci—ai)e{t)-\-X2{t) 

X2{t^l) = -a2Xi{t) ^ b2u{t) ^ {C2 - a2) e{t) (11.110) 

Xi{t) = y{t)-e{t) 

giving the matrices (under the canonical observer form) 



and the parameter vector 



Cl — ai 

C2 — CI2 



, C{0) = [ 10 ] 
( 11 . 111 ) 



0 = [ai , a 2 , bi , b 2 , ki , ^ 2 ]^ ( 11 . 112 ) 



with the equality 



ki = Ci — ai Vi 



( 11 . 113 ) 
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Implementation of Discrete Linear Kalman Filter for State Estima- 
tion 

Very often, the discrete Kalman filter is used for state estimation in the linear 
domain as a linear Kalman hlter or in the nonlinear domain as an extended 
Kalman hlter. A practical and simple implementation (Brown and Hwang, 
1997) of the linear Kalman hlter for state estimation, which is often used, is 
the following 
• Prediction stage: 



^k-\-l\k — -^k ^k\k V ^k '^k 5 ^ 0|0 — ^0 

P k-\-l\k = ^k Pk\k ^"k ^kQk^^k 5 ^0|0 = ^0 

• Optimal gain calculation: 



(11.114) 



(11.115) 



• Correction stage: 



^/c+l|/c+l — ^k-\-l\k V -^/c+1 (?//c+l ^ k-\-l^k-\-l\k) 

P/c+l|/c+l = {I — Kk^lCk^l)Pk^l\k 



(11.116) 



yk-\-i available measurements. In the case of a stationary model, 

the matrices P/c, Ck are constant. In the case of a nonlinear model, 
they are taken as the Jacobian matrices of this model at the operating point 
{xk^Uk) and thus are varying matrices. 

Example 11.2: State estimation for a pilot plant evaporator 

(Hamilton et ah, 1973) describes in detail the use of a linear Kalman hlter to 
estimate the states of a pilot plant evaporator whose model is given by (?). 



Smoothing Kalman Filter 

A rather frequent application of the Kalman hlter is the smoothing hlter. This 
problem, in particular, occurs in the case of measurements realized in a process 
with different sampling periods and delays (Lucena et ah, 1995). Having the 
measurements in an interval [0,V], we wish to estimate the state x at an 
intermediate instant i between the initial instant 0 and the hnal instant N . 
Compared to i, we thus have past information denoted by x^ (forward) and 
future information x^ (backward) (Borne et ah, 1990). The past estimations 
x^ can be calculated from Eqs. (11.90) between 0 and i. For future estimations 
x^^ we consider the system evolving in the opposite time direction for k between 
N and i, thus by setting k' = N — k 



Vk' = 



—Ak' Xk' — Bk' Uk' — Gk' Wk' 
Ck'Xk' +Vk' 



(11.117) 
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in which we must apply Eqs. (11.90) for k' varying from ^ to N — i. In the 
filter equations, it suffices to change to —Ak>^ similarly for Bk and Gk- 
The matrices Rk and Ck are unchanged. The initial conditions for k' = 0 
correspond to the final conditions for k. 

The estimation Xi is the optimal weighting of the estimations obtained in 
the forward way denoted by / and the backward way denoted by b 

Xi = [pr\pp-Y^[pf'x{ +PP~"x’’] (11.118) 

where Pf is the covariance matrix of the estimation error obtained by the 
forward formula at time i (similarly for the backward b). 

Stationary Kalman Filter 

The calculation of the state estimations is made easier in the case where the 
stationary form of the Kalman filter is considered. Moreover, the calculation 
will be faster. Then, the filter is not optimal anymore, but is only a sub-optimal 
filter. However, if the first estimations are different, the subsequent estimations 
will converge towards the same values as for the iterative filter. 

The formulae of the stationary Kalman filter are obtained by considering 
the derivative of the error covariance matrix to be zero in the continuous case: 
P{t) = 0, and the variation of this matrix to be zero in the discrete case: 
Pk^i = Pk 

In the discrete case, we must consider the discrete Riccati equation (11.92) 
which becomes 

P = APA^ + GQG^ - APC^ [G PG^ + R]-^G P^ A^ (11.119) 

and the Kalman gain will be constant and calculated according to Eq. (11.91), 
transformed into 

K = PG^ [G PG^ + R]-^ (11.120) 

11.2 Models of Time- Varying Linear Systems 

It suffices to express that the model parameters are no more constant, but are 
time-dependent, i.e. 

oo 

y{t) = gt{k) u{t — k) -\- v{t) (11.121) 

k=l 

In this case, the time- varying parameters gt{k) are, in fact, the impulse response 
at time t and thus appear as a weighting function of the input. 

In the case of the state representation, the matrices containing the param- 
eters simply depend on time 

x(t + l,e) = At{9)x(t,e) + Bt{9)u(t) + Kt{0)e(t) nii99'i 

y{t) = Gt{9)x{t,9)+e{t) 

The Kalman filter can be applied to linear systems with time-varying pa- 
rameters. 
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11.3 Linearization of Nonlinear Time- Varying 
Models 



Consider a model of a nonlinear system, expressed in the state space 



y(t) = /i[x(t)] + m[x(t), tx(t)] i;(t) 



(11.123) 



where w{t) and v{t) are white noises, respectively, of the model and the mea- 
surement, I and m are scalar functions. By assuming that the nominal regime 
corresponds to a sequence of inputs and a corresponding trajectory x*(t), 
the system can be linearized in the neighbourhood of the nominal trajectory 
by a Taylor hrst-order expansion by neglecting the higher-order terms 

x{t + l) = A{t)x{t) + B{t)u{t) +w\t) ni 194'! 

m = c{t)x{t)+v\t) 

with the deviations with respect to the trajectory 



x{t) = x{t) — x*(t) , y{t) = y{t) — h[x*(t)] , u{t) = u{t) — (11.125) 



and 

d d d 

A{t) = —f{x,u)\:c-(t),u-(t ) ; B{t) = —f{x,u)\:c-(t),u-(t ) ; C{t) = -^Kx)\x>(i) 

(11.126) 

Given the approximation around the reference, the noises of the linearized 
model can be considered as white noises. 



11.4 Principles of Parametric Estimation 

The principle of parametric estimation is to hnd a parameter vector that is 
able to make the prediction errors as low as possible. The considered models 
are linear with respect to the parameters to be identihed. 

11.4.1 Minimization of Prediction Errors 

The sequence of the prediction errors {e(t, 6>)}i<t<w can be considered as a 
vector for which it is necessary to dehne a norm Vn that is to be minimized. 
The norm depends on the parameter set 0 and on the data set . 

For example, use a linear stable hlter L{q) such that 

ef{t,0) = L{q)e{t,0) , l<t<N (11.127) 

We can then use a norm, which will be commonly in the form 
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where I is a scalar function. 

We will then deduce the estimation 0 of the parameter vector as the vector 
minimizing this norm Vn 



9{Z^) = arg |mJnV/v(6>, Z^) j (11.129) 

This general technique of minimization constitutes the PEM (prediction 
error methods) family (Ljung, 1987). 

The aim of the filter L{q) is to realize a weighting function acting on the 
frequencies, thus on the noise. 

Consider the classical case where the norm is a quadratic criterion (which 
facilitates derivation) 

1 ^ 1 

^ t=l ^ 

On the other hand, suppose that the process model, expressed as a transfer 
functions, is the following 



y{t) = G{q, 0) u{t) + H{q, 0) e{t) 

In this case, the prediction error is equal to 

e{t, 0) = H~^{q, 0) [y{t) - G{q, 0) u{t)\ 



(11.131) 



(11.132) 



According to Ljung (1987), the discrete Fourier transform (DFT) of the se- 
quence {e(t, 6^)} is equal to 



EN{27rk/N, 0) 



1 

Civ 



N 

e(t, 9) exp(— i27r/ct/A^) , /c = 0, . . . , — 1 

t=i 



(11.133) 



The chosen norm corresponds to the signal energy. Thus, it is possible to apply 
the Parseval-Plancherel relation to this signal. We thus get in the frequency 
domain 

Vm(0,Z^) = ^ F ^\EN{27Tk/N,0)\^ (11.134) 

^ k=0 ^ 

y(t) can be separated into two parts, one related to the input, the other one 
related to the noise; thus 



w{t^9) = G{q^9) u{t) ^ s{t^9) = H{q^9) e{t) (11.135) 

Their DFT are respectively equal to 



Wn{uj,9) 



(T(exp(icj), 9) Un{oo) + Rn,i{oo) , 



C 

with: \Rn^i{cj)\ < 

V N 

(11.136) 
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assuming that the transfer function G is stable and the input u(t) is bounded, 
and 



S]s[(uj^ 0) = Y]s[(uj^ 0) — (^(exp(icj), 0) Ujs[(uo) — Rjsf^i(uo) (11.137) 

As we can express the prediction error by 

0) = 0) s{t^ 0) (11.138) 

the DFT of the prediction error can be expressed as 

C 

En{cj^ 6) = i7“^(exp(icj), 6) Sn{cj) + Rn,2{^) , with: |i?Ar,e(cj)| < 

yN 

(11.139) 

assuming that the transfer function H is stable and the signal s(t) is bounded. 
The norm to be minimized results 

1 ^ 1 

Vn{6,Z^) = j^Y.2 |ff“'(exp(i27rfc/iV),0)|2 \YN{27rk/N),e)- 

k=l 

G{2TTk/N), 6) UN{2Trk/N), 6^ + Rn,s , with: \RnAlo)\ < ^ 

(11.140) 

Because of the frequency form of this expression, the identihcation methods by 
prediction error are closely related to the methods of spectral analysis (Ljung, 
1987). 

11.4.2 Linear Regression and Least Squares 

The models of linear regression provide the output prediction 

At\e) = A{t)e (11.141) 

where (/) is the observation vector and 6 is the parameter vector. 

Estimation of the Parameter Vector 

The prediction error is equal to 

A, s) = y{t) - A{t) e (11.142) 

Retaining the previous notations, the hlter L{q) is taken to be equal to 1 and 
the chosen norm to be minimized is 

VN{e,z^) = ~Y,i^ [y{t)-A{t)e]^ (11.143) 

^ t=l ^ 

The ratio 1 /2 is only introduced for derivation reasons and the ratio 1 /N could 
be omitted without modifying the results. This norm is a quadratic function 
with respect to 0. It suffices to express that the condition of minimum of the 
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norm implies that the gradient vector is zero at the extremum, thus that the 
partial derivatives with respect to 0 are zero. We deduce the estimator of the 
parameter vector according to the least-squares criterion 



qLS 



1 

~N 



N 



t=l 



^ 



r N 



_t=l 



provided that the inverse of the following matrix exists 

1 ^ 

^ t=i 



N 

t=i 

(11.144) 

(11.145) 



Recall that for the ARX model, the observation vector (f>{t) was equal to 

Ht) = [-yit - 1) , ■ ■ ■ , -y{t -na),u{t-l), , u{t - rn,)f (11.146) 

The terms of the matrix R{N) are thus 

im)h = 



N 



N 



'^y{t-i)y{t-j) 



if: i <i,j <na 



t = l 
N 



'^u(t-i + no) y{t - j) if: ria < i < ria + rib and 1 < j < Ua 



t = l 
N 



^ y{t - i) u{t - j + Ua) if: 1 <i <ria and ria < j < ria ^ ^6 



t=i 

N 



N 



ria) u{t - j + ria) if: ria <hj <ria^ 



rib 



t=i 



(11.147) 

which implies that the matrix R{N) is, in fact, composed of estimations of the 
covariances of {y{t)} and {u{t)} whose calculation allows us to deduce the best 
estimation of the parameter vector. 

This multiple linear regression is often presented in matrix form 



e = Y-^0 (11.148) 

where e is the vector of the prediction errors, Y is the output vector and ^ is 
the matrix gathering the vectors (j) for the set of the experiences. 6 remains the 
parameter vector. The chosen norm is equal to 

Vm{9,Z^) = ]^[Y -^9] (11.149) 

The condition of zero gradient gives 

-Y’^ $ + 6 ^^ ($^$) = 0 



(11.150) 
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So that the solution 6 exists, the matrix must be nonsingular. This 

matrix is semipositive dehnite by construction, thus it suffices that it is strictly 
positive. When the solution exists, it is equal to 



qLS ^ ^Ty 


(11.151) 


The drawback of this method is that it demands the inversion of the matrix 
which is often ill-conditioned, and thus poses numerical problems and 
for this reason is badly adapted to on-line use. 


Estimator Properties 




Let Oo be the “true” (exact if the model represents exactly the process) para- 
meter vector responding to the model 


y{t) = 4>^{t)0o + e{t) 


(11.152) 


thus in matrix form 

Y = ^'^00 + e 


(11.153) 


Assuming that e{t) is white noise (zero mean, variance A^), it 
verify (Soderstrom and Stoica, 1989) the following properties 
a) The estimator 0^^ of the parameter vector is non-biased 


is possible to 


E(^^^) = 6>o 


(11.154) 


thus the estimator 0^^ of the parameter vector must tend towards the value 
Oo when the number of the observations tends towards inhnity; on the other 
hand, it must be as close as possible to Oq. 

The estimator 0^^ of the parameter vector is related to Oq by the relation 


0^s = + ($^$)-i$^e(f) 


(11.155) 


b) The covariance matrix of 0^^ is equal to 




cov{0^^) = A2($^$)-i 


(11.156) 



c) A non-biased estimation of the variance of white noise is given by 

s2(e) = A2 = 2 (11.157) 

where n is the dimension of the parameter vector 6. 

From these properties, it is possible to deduce the variance of each parameter 
6i as 

s^{6i) = (j‘^{6i) = cavii s^(e) (11.158) 

where covjj is the current element of the matrix ($^$)“^, and also the conh- 
dence interval of 6i at the signihcance level a according to the Student t law 



^a/2, ^ ^ ta/2, 



( 11 . 159 ) 
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Weighted Least Squares 

In the case of weighted least squares, the criterion simply contains a weight for 
the measurements which can depend on the time 

VN{e,z^) = ( 11 . 160 ) 

^ t=l 



By setting 



we obtain 

^LS 



0{N,t) = 



r N 



_t = l 



N 

'^P{N,t)(p{t)y{t) 

t = l 



( 11 . 161 ) 



( 11 . 162 ) 



Case of a Coloured Noise 

When the noise v{t) is coloured (by opposition to white noise), the least-squares 
estimator does not converge towards the true value Oq. In this case, Ljung 
(1987) introduces a linear filter describing the noise in the form 

= (11.163) 

where e{t) is white noise. The model becomes 

Mq) yit) = B{q) u{t) + e(t) A{q) D{q) y{t) = B{q) D{q) u{t) + e{t) 

(11.164) 

to which the least squares can be applied, which allows us to estimate the 
parameters of polynomials AD and BD (method of repeated least squares). 
The best non-biased estimator (Soderstrom and Stoica, 1989) is equal to 

§ = [R-^ $($^ R-^ Y = R~^ R~^ Y (11.165) 

with 

E{vv'^) = R (11.166) 

11.4.3 McLximum Likelihood Method 

Principle of Maximum Likelihood 

The maximum likelihood is first exposed in a general manner following Kendall 
and Stuart (1979). Let ^i, . . . , be a set of exclusive propositions and H the 
available information. Let p be a subsequent proposition which is also exclusive. 
From the probability properties, it results that 



r{qi,p\H) = r{qi\H)V{p\qi,H) 



( 11 . 167 ) 
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and 



V{qi\p,H) = 



riqi\H)V{p\qi,H) 



V{p\H) 



and by summing the set of propositions 



^ V{qi\H)V{p\qi,H) 
^ V{p\H) 



(11.168) 



(11.169) 



from which the Bayes theorem is deduced 



, rr. ^ nqi\H)V{p\qi,H) ^ V{qi,p\H) 

^ ^ J 2 i[nqi\H)r{p\qi,H)] J2inqi,p\H) 



(11.170) 



which gives the probability of a proposition qi when p is known. The variables 
V{qi\H) are called prior probabilities, the variables V{qi\p^ H) posterior proba- 
bilities and V{p\qi^H) the likelihood function (or simply likelihood), which is 
often denoted hy C{p\qi^H). The posterior probability varies as the product of 
the prior probability and the likelihood 



V{q^\p, H) oc V{q^\H) C{p\q,, \H) (11.171) 

The principle of the maximum likelihood stipulates that when we are faced 
with a choice of hypotheses qi^ we choose the one which maximizes C. 

Now, place ourselves in the context of parametric identihcation (Ljung, 
1987). The observations ^(1), . . . , y{N) may not be perfectly reliable; with each 
observation of the observation vector y = {^(1), . . . , y{N)}, we can associate 
a probability density function p{y\0), and for the set y of the observations 
considered as independent the joint probability density function C{y\0). Let y^ 
be the observed value (e = experimental) of vector y ; to this vector corresponds 
an estimation 0^ of the parameter vector. The probability that y can take the 
value y^ is proportional to the likelihood function C{y^\0). In these conditions, 
given the observation vector, the estimator of the parameter vector 6 according 
to the maximum likelihood method is equal to 



e^L^yC) ^ I niax/:(y"|^) 



(11.172) 



The maximum with respect to parameter vector 6 is thus searched in order to 
make the vector y as close as possible to the hxed value y^ . 

Example 11.3: Estimation by Maximum Likelihood 

For a set of variables Xi distributed according to the normal law (unknown 
mean /io, known variances cr|), the likelihood function is equal to 



£(a;®|/i) 



N 



n 






exp 



) 



(11.173) 
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The estimator of the mean according to the maximum likelihood gives 
^ML(^e) =arg{max^ C{x^\j^)} 

1 Ax, (11-174) 

We notice that when the variances are equal, this estimator gives the same 
result as the sample mean 

1 ^ 

= (11-175) 

i=l 



We can estimate the quality of an estimator 6 with respect to its true value Oo 
(in general unknown) by the expectation of the covariance matrix 

P = E [0{y^) - ^o] [Ky") - 0^"^ (11.176) 

which we try to make as small as possible. But the Cramer-Rao inequality 
(Kendall and Stuart, 1979; Ljung, 1987) stipulates that there exists a limit 
lower than P 

E p(y«) - 0o] [Ky") - > T-\G) (11.177) 

where T{6) is the Fisher information matrix defined by 

n T 

1(0) =E ^log£(y«|0) ^log£(y«|0) (11.178) 

L -I L J \q=q^ 

In fact, the estimator 6ml converges asymptotically towards a normal distribu- 
tion of mean and of covariance matrix X~^{9o) (Kendall and Stuart, 1979). 

Estimator Determination 

Now consider an input-output probabilistic model; let be the set of input 
and output data until time t. The predictor is represented by the model 

y(t\t-l) = (11.179) 

The probability density function py{y{t) = yo\Z'^~^, 6) corresponds to the pro- 
bability that y(t) is equal to some value given and 6>, and is simply de- 
noted by Py (y{t) I ^ , 6>) . With the previous model is associated the probability 

density function pe(e(0 = of the prediction error e = y{t)—y{t\t — l). 

The model is then in the form 



y{t)=g{t,Z* \0)+e(t,0) 



(11.180) 
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In these conditions, the likelihood function is equal to 

N N 

= Y[Pe{y{t) - g{t,z^~^;0)\0) =Y[pe{e{t,0)\0) (11.181) 

t = l t = l 

The maximization of this monotonous function is equivalent to that of its lo- 
garithm; we will thus maximize 

1 1 ^ 

— logCy{y'^\e) = — ^logpe(e(^,6')|6') (11.182) 

^ t = l 

The following function, related to the logarithm of the likelihood, and also 
related to the notion of information entropy and the information matrix, is 
frequently introduced 



l{e,9,t) = - log pe{e{t,0)\0) (11.183) 

The estimator is deduced according to the method of maximum likelihood 

0^qy^) =arg|mml^«(e,0,t)| (11.184) 

This method of maximum likelihood may be intricate and may necessitate 
resorting to approximations related to the difficulty of expressing the predictor, 
often in the form of a Kalman predictor. 

In the case where the prediction errors are distributed in a Gaussian way 
(zero mean, covariance independent of t) , we obtain 

I I 

/(e, 6>, t) = const + - log A + (II. 185) 

11.4.4 Correlation of Prediction Errors with Past Data 

A good model is characterized by the fact that the prediction errors that it 
provides are independent of the past data. Let be the set of past data 

at time t — 1] the predictor y{t\0) is said to be ideal if the prediction error 
e(t^6) is independent of the past data To check it, we must take a vector 

sequence {C(0I verify that it is uncorrelated with any linear transformation 
of {e(t, 6^)}. In practice (Ljung, 1987), we choose a linear hlter L{q)^ which we 
make act on {e(t, 6^)} 

epit^O) = L[q) e(t^0) (II. 186) 

Another sequence of correlation vectors denoted by {(“(t, 6^)} or simply 

{C(L^)}, constructed from the past data and from 6>, is chosen, as well as a 
function a{e). At last, we calculate the estimation of 6 such that 

I ^ 

^ t=i 



(11.187) 
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This estimator is the best that can be found from the past data. 

Remark: 

In the case of pseudolinear regressions, the prediction models are in the form 

y{t\e) = <p’^{t,e)e (ii.iss) 

We choose (“(t, 0) = (^(t, 0) and a{e) = e, so that the estimator of the pseudo- 
linear regression is the solution of 

1 ^ 

— ^ e) [y(t) - e)e] = o (ii.isd) 

^ t=l 

This pseudolinear designation comes from the fact that if 0) depends 
effectively on 6>, the predictor y{t\0) is not linear with respect to 0. The linear 
regression according to the least squares is a particular case of the pseudolinear 
regression. 

11.4.5 Instrumental Variable Method 

The instrumental variable method (Soderstrom and Stoica, 1983) is a variant 
of the least-squares method which enables to avoid the bias of the estimated 
parameter vector when the disturbance is not white noise. Moreover, the com- 
plexity of this identification method is moderate. 

Method Principle 

Assuming that the regression model is linear as 

y{t\e) = <p'^{t)e (11.190) 

an estimator of the parameter vector according to the least squares is obtained 
by minimization of the following criterion 

r 1 ^ 

e= arg < mine ^ [y'^ (t) - 6^ 4>{t)] [y{t) - <jF{t) 6] 

{ t=i 

1 ^ 

w “ w 0] = 0 ^ (11.191) 

^ t=i 

Tl ^ -1-1 r iv 

^ t=i ] t=i 

From the viewpoint of the correlation of the prediction errors with the past 
values, this would correspond to L{q) = 1 and (“(t, 0) = If the system is, 
in fact, described by a model, including a disturbance 'c(t), such that 




yit) = +v{t) 



with: v{t) = H{q i) e{t) 



(11.192) 
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the observation vector (^(t) is then correlated with v{t) and the estimator does 
not tend anymore towards the true value Oq (the estimation is not consistent). 
From Eqs. (11.191) and (11.192), we deduce 



0 = Oo -\- 



1 

N 



N 



t=i 



^ 



(11.193) 



The condition that 6 converges towards the true value 6o is that the distur- 
bances is not correlated with the outputs 



E[<P{t)v{t)]=0 



(11.194) 



The instrumental variable method is designed so as to remedy to this possible 
bias of the estimator 6 when the least squares are used. 

Introduce a vector (“(t), whose components are called the instruments or 
instrumental variables, and are such that the instrumental variable vector (“(t) 
is not correlated with v(t). The estimator is then given by the solution of the 
following system 



TV TV 

^ C(^) [y{t) - ^] = ^ C(^, 0) e{t) = 0 (11.195) 

^ t=l ^ t=l 



Provided that the following inverse matrix exists, the estimator is then given 
by 






1 ^ 






(11.196) 



The conditions that the instrumental variable vector must fulhl are 



is nonsingular 

\ E[at,e)v{t)]=o 



(11.197) 



The instrumental variables must be sufficiently correlated with the regression 
variables so that the matrix is invertible but not correlated with the noise. 

The implementation of the instrumental variable method is described in 
Sect. 12.3.6. 



Application to an ARX Model 

Suppose that the used model is an ARX model such as 

y{t) + aiy{t -1) + . . . + an,, y{t - Ua) = biu{t - 1) + . . . + bn,, y{t - Ub) + v{t) 

(11.198) 

while the true model would be 



y{t)+{ai)o y{t-l)+. . .+(«„ Jo y{t-na) = (6Jo u{t-l)+. . .+(6nJo y{t-nb)+v{t) 

(11.199) 




418 



References 



In order to ensure the conditions necessary for the instrumental variables, we 
choose them to be equal to 

c(t) = K{q) \-x(t - 1), -x(t - 2), , -x(t - Ha), u{t - 1), . . . , it(t - rih)]^ 

( 11 . 200 ) 

where K{q) is a linear hlter and x{t) is generated from the input u{t) through 
a linear system according to 



x{t) 



N{q) 

D{q) 



u{t) 



with the polynomials of the transfer function 



N{q) = mo + mi ^ + . . . + m^^ q ^ 
D{q) = l^niq~^ ^ ...^Un^ q~^^ 



(11.201) 



(11.202) 



The instrumental variables (“(t) depend on the past inputs u(t — 1) .. . through 
a linear hlter. 

This method can be used in open loop, as the inputs do not then depend 
on the noise v(t) of the system; it will then have to be modihed in closed loop. 

Ideally, the hlter should be equal to the transfer function of the system with 
N[q) = B[q) and D[q) = A[q) or 



x(t, 9) 



B{q) 

A{q) 



u{t) 



(11.203) 



SO that the instrumental variables depend on the parameter vector 0 . 
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Chapter 12 

Parametric Estimation 
Algorithms 



In view of adaptive control, many algorithms of process model parameter esti- 
mation have been developed. 

Recall that, according to the used hypotheses (Ljung and Soderstrom, 1986), 
the aim is to make the predictions at time t, knowing the inputs and outputs up 
to t — 1 which form the observation vector denoted by while the estimated 
parameter vector is denoted by 0{t). Some authors (Dugard and Landau, 1990; 
Landau, 1988, 1990) make the prediction at t + 1 and denote by (^(t) the ob- 
servation vector up to t, while the estimated parameter vector is denoted by 
6(t + 1). The difference thus bears on the adopted notation for (j)(t) and the 
prediction instant. This will lead to subscript differences in the formulae; thus 
it is necessary to carefully look at the bases of the reference textbook. 



12.1 Linear Regression and Least Squares 

This algorithm, which is the simplest, has already been discussed in a previous 
chapter. It is essentially devoted to an on-line usage, i.e. to estimate the 
parameters of a fixed model. Here, we will note some problems concerning it. 
We assume that the prediction model is 

y{t\e) = it) e ( 12 . 1 ) 

The estimator of the parameter vector minimizing the quadratic criterion of 
the sum of the squares of the prediction errors 

1 ^ 

^ t = l 



( 12 . 2 ) 
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can be written in the form 

r-i iv 1“C ^ 

(12.3) 

^ t=l ] ^ t=l 

provided that the inverse of the matrix introduced exists. Set the matrix R 

1 ^ 

= (12.4) 

^ t=l 

and the vector f{N) 

1 ^ 

/(^) = vE^(^)y(^) (12.5) 

^ t = l 

which implies that the estimator of the parameter vector is the solution of the 
system 

R(N) = /(AT) (12.6) 

The algorithm thus defined is non-recursive. 

The matrix R{N) can be badly conditioned (the ratio of the largest singular 
value to the smallest one is very large), in particular when the inputs are not 
sufficiently exciting. In this case, it is better to seek a matrix Q such that 

QQ^ = R{N) (12.7) 

which we can obtain by the Householder or Gram-Schmidt transformation, or 
the Cholesky decomposition (Golub and Loan, 1989). 

Remark 1: 

(Ljung, 1987) shows that this decomposition of R improves the conditioning 
of the matrix. This demonstration can be done in the multivariable case (m 



inputs, p outputs) by setting 

( 12 . 8 ) 

where y{t) is the output vector of dimension p. Let 

$^ = [0(l), ...,0(iV)]^ (12.9) 

The criterion to be minimized is equal to 

N 

Jn{0) = \y-^o\^ = - A(^) of (12.10) 

t=l 



This norm is not modified by an orthonormal transformation P (a matrix P 
is orthonormal if P P^ = J), so that 

Jn{0) = |P(y-$0)|2 



( 12 . 11 ) 
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The matrix P is chosen in order to realize a factorization QR of ^ 






Q 

0 



( 12 . 12 ) 



A possibility is to take P to be equal to a product of Householder transforma- 
tions such that 



PY 



L 

M 



(12.13) 



hence 



Jn{0) 



L 

M 



Q 

0 



2 



0 



\L-Q0f + \Mf 



(12.14) 



We deduce that the criterion is minimal when 



Q0 = L 

giving the new value of the estimator. 

We can notice that 

Q= [P^f = R(N) 



(12.15) 



(12.16) 



Thus, the new condition number is equal to the square root of the condition 
number of R{N) and the new system 



Q0 = L 



is better conditioned than the old system 

R{N)0^^ = f{N) 



Remark 2: 

The vector ^ can be put in a global form 



^ = 



z{t - 1) 
z{t — n) 



(12.17) 

(12.18) 



(12.19) 



where z gathers the set of the variables necessary for knowledge of (^, e.g. for 
an ARX model z = u] and n is the largest shift necessary to form which 
is related to the structure of the model. According to Ljung (1987), it is better 
to calculate the sums (12.4) and (12.5) from t = n + 1 instead of t = 1. The 
fast algorithm of Levinson (Ljung, 1987) derives from the resulting structure 
of the matrix R. 



12.2 Gradient Methods 

Gradient methods are simple and explicit. However, in the neighbourhood of 
the minimum, Newton-type methods such as Newton-Raphson, Gauss-Newton, 
quasi-Newton methods, or even Levenberg-Marquardt (Fletcher, 1991; Gill 
et ah, 1981) are preferable with respect to their rate of convergence. 
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12.2.1 Gradient Method Based on a Priori Error 

The deterministic model of the process is written in linear form 

y{t) = (jF {t) 9 = 9^ (j){t) (12.20) 



where 9 represents the true value of the parameter vector. The associated 
predictor is written in the general form 



y{t\e) = 4>{t) 



e.g. for an ARMAX model defined by 



yit) 






A{q) 



A{q) 



( 12 . 21 ) 



( 12 . 22 ) 



the observation vector of the prediction model is equal to 

= [-y{t-l), ...,-y{t-na),u{t-l), ...,u{t-ni,) , 

e(t -1,6'), . . ., e(t - nc,0)f 

and the parameter vector 

6^^ = [«1 , ■ ■ ■ , , 6l , ■ ■ ■ , 6n„ , Cl , . . . , C„J^ (12.24) 

Two types of predictor are distinguished (Landau, 1990): 

• The a priori predictor 

y°it) = y{t\s{t - 1)) = - 1) <P{t) (12.25) 

where we use the information contained in i^(t) until instant t — 1, the 
parameters being estimated at t — 1. 

• The a posteriori predictor 

y{t) = y{t\e{t)) = e'^it) (pit) (12.26) 



where we still use the information contained in (j)(t) until instant t — 1, while 
the parameters are estimated at t. 

Two types of prediction error are associated with these two predictors: 

• The a priori error 

= y(t) - y°(t) (12.27) 

• The a posteriori error 

e(^) = y(t) - y(t) (12.28) 

In order to avoid having to completely calculate again the parameter vector 
at each time step, we frequently use the structure of a parameter adaptation 
algorithm, through which we update the parameter vector at time t from the 
previous one at time t — 1 

eit) = e{t - 1) + Aeit) = e{t - 1) + f[6{t - 1), (pit), e°it)] 



(12.29) 
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by minimizing an objective function J at each step with respect to the para- 
meter vector 6(t — 1) 

min J(t) = [e^(t)]^ (12.30) 

We will note that this criterion bears only on the instantaneous error. 

A classical minimization method is the gradient method which includes 
many variants; it consists (Fig. 12.1) of getting near to the optimum by moving 
along the gradient in the opposite direction to the gradient, i.e. along the 
normal to a contour (iso- response or iso-criterion line). Recall that gradient 
methods are less efficient than Newton-type methods near the optimum. 




Figure 12.1: Optimization according to a gradient-type method 



The gradient associated with the criterion is the vector 



9J{t) ^ o de°{t) ^ 

dOit - 1) dOit - 1) ^ 



(12.31) 



Example 12.1: Gradient Calculation for the General Model 

We wish to calculate the gradient (Ljung, 1987) in the case of the general model 
(11.53) for which the following predictor is 






C{q)F{q) 



u(t) + 



D{q)A{q) 

C{q) 



y{t) 



As the a priori error is equal to 



= yit) - y°0) = y0) - yW0 - 1 )] 



(12.32) 



(12.33) 



to calculate the components of the gradient vector dJ{t)/dO, hrst calculate the 
partial derivatives of y{t\0) with respect to the successive parameters a/^, bk, 
Cki dki fk of the vector 6 






C{q) 



y{t - k) 



(12.34) 
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C{q)F{q) 



u(t — k) 



dck 



= 



D{q)A{q) 



y{t -k)- 



C{q)C{q) 

e^(t — /c, 0) 



Djci)B{q) 
C{q) C{q) F{q) 



u{t — k) 



C{q) 






A{q) 

Ci^q) 



y{t -k) + 



B{q) 



C{q)F{q) 



u{t — k) 



C(i) 

^im = - 



d 



v(t — k, 9) 

D{q)B{q) 



C{q) F{q) F{q) 

w{t - k) 



u{t — k) 



C{q)F{q) 

The gradient vector is then calculated as 

AIK. = 2AAL = - 252 % m - ym) 



dO{t - 1) dO{t - 1) 



dO{t - 1) 



(12.35) 

(12.36) 

(12.37) 

(12.38) 

(12.39) 



If we represent the linear deterministic model in the usual form 

y{t) = e'^ (f>{t) (12.40) 



the half-gradient is simply expressed as 



1 dJ{t) 

2 dO{t - 1) 



(p{t) e°{t) 



(12.41) 



SO that the parameter adaptation algorithm will be written in a general manner 



= 9{t - 1) ^P(t){t) e%t) (12.42) 

where P is the adaptation gain matrix. The gain matrix must be symmetrical 
positive definite. 

The parametric error 0{t — 1) is the difference between the estimator 9 of 
the parameter vector and the true value 9 of this vector 



9{t-l) = 9{t-l)-9 (12.43) 

The a priori error is then equal to 

e°(^) = y(^) - y°it) = - 9’^ {t- 1) <P{t) = -Fit - 1) 4>it) (12.44) 

while the algorithm of parametric adaptation expresses the parametric error as 

9{t) = 9{t — l) — P (j){t) (jF {t)9{t — l) = \l — P (j){t) (jF {t)\ 9{t — l) = A{t)9{t — l) 

(12.45) 
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The stability of the algorithm of parametric adaptation implies that the eigen- 
values of this matrix A must be inside the unit circle. 

Example 12.2: Condition for the Gain 

Suppose that the gain matrix is a scalar positive matrix 



P = al 



then, it is necessary that 



which implies 



\\I - a {t)\\ < 1 

1 



(12.46) 

(12.47) 

(12.48) 



12.2.2 Gradient Method Based on a Posteriori Error 



Generalities 



The criterion in the simple gradient method is the square of the a priori error; 
the improved gradient method relies on the use of the square of the a posteriori 
error, giving the criterion 

min J(t) = [e(t)]^ (12.49) 



where, again, the instantaneous error is concerned. The gradient is then equal 
to 



9J{t) _ ^ de{t) 
de{t) de{t) 



(12.50) 



and as the a posteriori error is equal to 



e(^) = y{i) - y{t) = y{t) - 4 >{t) (12.51) 

the half-gradient becomes 



(12.52) 

2 dO{t) ^ ’ 

and the parametric adaptation algorithm becomes 

6{t) = 6{t-l)^P (j){t) e{t) (12.53) 

Again, the gain matrix P must be symmetrical positive dehnite. As the a 
posteriori error e{t) is not known, this algorithm is not physically realizable 
(causality principle). To realize it, we must operate a transformation depending 
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on the known data at t — 1, which is realized by relating the a priori error e^(t) 
and the a posteriori error e(t). The a posteriori error becomes equal to 



e(^) = y{t) - y{t) = y{t) - 4>{t) 



= yW - - i)J 



iT 



(12.54) 



We deduce 



e{t) 



I + (j)^ {t) P (j){t) 



(12.55) 



and the expression of the parametric adaptation algorithm in this improved 
version 



m 



9(t — 1) + 



PHt) 



(12.56) 



The gain matrix P must still be positive definite; on the other hand, we can 
verify that this algorithm is stable for any gain P. To a certain extent, the 
gain matrix P could be chosen in any manner by the user, provided that it is 
positive definite. 




Nonlinear block, with time-varying parameters 



Figure 12.2: Equivalent feedback representation of the adaptive algorithm 
The error of the parameter vector {6 being the true value) is defined by 

9{t) = 9{t) - 9 (12.57) 

giving the a posteriori error 

e{t) = 9 — (jP^ {t) 9{t) = 9{t) (12.58) 

and the adaptation algorithm, with respect to deviation variables, 

9{t) = 9{t-l)^P (j){t) e{t) (12.59) 

The adaptive algorithm defined by Eqs. (12.58) and (12.59) can be represented 
in an equivalent manner as a feedback scheme (Eig. 12.2) (Dugard and Landau, 
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1990), where the signals are deviation variables. The system thus represented 
by the feedback scheme is not a linear system. 

The stability of this algorithm does not mean that the estimated parameter 
vector converges towards the good value. 

Extended Horizon 

The fact that the a posteriori error e^(t) is minimized does not necessarily imply 
that the sum over an extended horizon is minimized; moreover, the 

convergence may be slow and oscillations around the minimum may occur. It 
is clear that the gain must be variable: large at the beginning of minimization, 
then small when getting closer to the minimum. 

Consider the new criterion for an extended horizon based on the sum of the 
a posteriori errors 

t t ^2 

i=l i=l i=l 

(12.60) 

The solution is obtained by making the gradient equal to zero, hence 

= -2^ l^y(i) -y^(t)0(i) 4>{i)=Q (12.61) 

We deduce the estimation of the parameter vector 

t t 

= P{t) (f){i) with: (f)^ (i) (12.62) 

i=l i=l 

This equation thus dehnes the intensity of the displacement in the gradient 
direction corresponding to Eq. (12.53). Notice that the inverse P~^{t) of the 
matrix depends only on the measured outputs. This algorithm is non-recursive 
and obliges us to invert a matrix, which may be numerically delicate. It is 
necessary to wait for a number of steps equal to the dimension of parameter 
vector 9 before starting it. 

12.3 Recursive Algorithms 

12.3.1 Simple Recursive Least Squares 

Frequently, it is desirable to have at our disposal methods that allow us to 
adjust on-line any model in view of the prediction, hltering or control which 
will be qualihed as adaptive. They must display high performance with respect 
to stability, convergence and operation rate, as they must be performed during 
a sampling period. Often, they will also be used off-line. 

In identihcation, the aim of a recursive algorithm is to hnd the new estima- 
tion 6(t) from 6(t — 1) without having to perform again all the calculations. 
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According to the least-squares criterion, we consider again Eq. (12.62) 

t t 

= P{t) (f){i) with: (f)^ (i) (12.63) 



Let us write the sought estimation in a recursive form 



e{t) = Pit) 4>ii) = 0{t - 1) + Aeit) 



(12.64) 



and P ^{t) also in a recursive form 



p ^ it) = '^<pii)<fii) 



P^<Pii) A(*) + <Pit) A(^) 

p-'^it - 1) + 4>it) (f^it) 



(12.65) 



Notice that (jP^ {t) > 0. Thus, as the inverse of the gain increases with time, 
the gain diminishes. 

We then operate the following transformation 



P^yii)<Pii) = P ^it)Kt) = P^y{i)<Pii) +yit)<Pit) 

= p-^it -l)eit -l) Pyit)(pit) 

= p-'^it - 1 ) e{t - 1 ) + yit) 4>it) + 4>it) (p'^it) §it - 1 ) 
-<Pit) 4>'^it)eit - 1) 

= p-^it)eit-i) + 4>it) yit) - e’^ it - 1) 4>it) 

= P~^it) 9it -1) + 4>it)e°it) 



We thus deduce the recurrence for the parameter vector 
0(t) = 0(t - 1) + P(t) (j)(t) 

We already noticed that the gain P(t) is decreasing, as we had 
p-i(t) = p-^it - 1) + 4>it) y (t) 



( 12 . 66 ) 



(12.67) 



( 12 . 68 ) 



During on-line operations, we avoid a matrix inversion because of possible 
numerical difficulties, and we make use of the matrix inversion lemma (11.84) 
with 

A = p-\t - 1) ; B = (pit ) ; C = I ; D = (p'^it) (12.69) 



It results that 



Pit) = Pit - 1) 



pjt-i)(pit) (p'^jt) pjt -1) 
1 + (p’^it) Pit - 1) (pit) 



(12.70) 
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where it appears even more simply that the gain is decreasing. The algorithm of 
recursive least squares (called RLS) based on the a priori error then constitutes 
the set of the three following formulae 



r 0{t) : 

I m 



- 1) + P{t) (t){t) e^{t) 

pj* - - 1) 

1 + p{t - 1 ) (p{t) 
y{t) - 0'^{t - l)(p{t) 



(12.71) 



Formula (12.70) 
the relation 



multiplied on each side of the equality sign by (^(t) gives 



P{t)Ht) 



i + (p’^{t)P{t - 



(12.72) 



and relation 



i + 

(12.73) 

which are useful in deducing the algorithm based on the a posteriori error. 

It is better to use the a posteriori error in order to make a correction; the 
a posteriori error is equal to 



e{t) 



y{t) - e’^{t)(f>{t) 
y(t) 






€°{t) - [P{t) (pit) e°it)Y (pit) 

o,p _ <P'^{t)P{t - 

1 + (p'^it) Pit -l)(pit) 

l+(p'^it)Pit - l)(pit) 



(12.74) 



The algorithm of recursive least squares (RLS) based on the a posteriori error 
then constitutes the set of the three following formulae 



' m 

Pit) 

eit) : 



6it - 1) + Pit - 1) (pit) eit) 

Pj* - 1 ) Pj* - 1 ) 

l+(p'^it)Pit-l)(pit) 

y{t) 



1 + (p'^it) Pit -i)(pit) 



(12.75) 



To start the algorithm of recursive least squares at t = 0, a very large adaptation 
gain P(0) must be chosen; typically, take P(0) = GqI with (To ^ 1. This gain 
P(0) represents a measure of the conhdence of the initial estimation 6^(0). 

The algorithm assumes that the system parameters are constant. 

The algorithm (12. 71) or (12.75) of recursive least squares provides unbiased 
estimations of the parameters only for ARX models in the form 



Mq) yit) =q Biq) uit) + e(t) 



(12.76) 
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where the disturbance is white noise. For this model, the parameter vector 9 
is given by Eq. (11.9) and the observation vector by Eq. (11.14). 
Interpretation According to Kalman Filter 

We can consider the Kalman filter treated in Sect. 11.1.2 as a parameter 
estimator (Dugard and Landau, 1990; Ljung, 1987; Soderstrom and Stoica, 
1989). 

A system linear is written in state space in the form 

x(t + l) = A{t)x{t) ^ ^w{t) n9 77i 

y{t) = C{t)x{t)^v{t) 



where v and w are uncorrelated white noises (of zero mean and respective 
covariance matrices R and Q). 

The linear regression model is 

mo) = <f{t)e (12.78) 

where the parameter vector is assumed constant 

(9(t + l) = 9{t) = 9 (12.79) 

so that the real output is equal to 

y{t) = (j)^ {t) 9{t) + v{t) (12.80) 



In the present case of the estimation of parameter vector 6>, the state is represen- 
ted by the system parameters, and we have x{t) = 9{t) ] A = I (no dynamics 
for the system); 5 = 0; w(t) = 0 (no noise on x): Q = 0 ; x(t) = 9(t)] 
y{t) =y{t + l)-,c = (jX{t ) ; v\t) = v(t + 1) ; F,[v(t)v'^ (t)] = R ; y(t) = y°{t + 1 ). 
The system then becomes 



e{t) = 0{t - 1) 

y(t) = (t) e(t - 1) + v(t) 



(12.81) 



The Kalman filter, an optimal one-step predictor that allows us to estimate 
the state of the system, according to Eq. (11.91), is written 

x{t + l\t) = Ax{t\t -1) + AK{t)[y{t) - y{t\t-l)] (to 

y{t\t-l) = Cx{t\t-1) ^ ’ 



where K{t) is the Kalman gain. The estimation error of the state is equal to 

x{t\t — 1) = x{t) — x{t\t — 1) (12.83) 

and the estimation error covariance is 

E[x{t\t — 1) x"^{t\t — 1)] = P{t) (12.84) 



thus the gain evolution gives an idea of the variation of the estimation error 
covariance. 
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We then seek the Kalman gain K{t) to minimize the estimation error co- 
variance matrix (Eq. (11.91)) 



K*{t) = arg{minxtrace[P(t)]} 

= P{t - 1) [C P{t -1)C^ + RY^ ^ ^ ^ 

By replacing K (t) by its optimal value, we hnd the adaptation gain matrix 

P{t) = AP{t-l)A^+Q-AP{t-l)C'^ [C P{t-l)C'^ + R]~^ CP{t-l)A'^ 

( 12 . 86 ) 

which is a solution of a Riccati equation (see the Riccati equation (11.92) of 
the Kalman hlter as a one-step predictor). 

By applying the Kalman hlter to our model {A = I^C = we thus 

obtain the following recursive equations based on a priori error 



' m 

< p{t) 



6{t - 1) + K{t) e°{t) = 6{t - 1) + P{t) <p{t)e° 
P(f_-\)_ - 1) (Y{t) P(t - 1) 

^ ; R + 4>T(t)P{t-l)4>{t) 

y{t) - {t - 1) <p{t) 



(12.87) 



K{t) is the Kalman gain, which is equal to 

We notice that these equations are the same as for the RLS if R = 1 and 
we assume that Q = 0. If Q were different from zero, the estimation error 
covariance matrix P{t) could not tend towards 0. Given R = 1 and Q = 
0, the parameter vector 6 possesses a Gaussian distribution of mean 9 and 
covariance matrix P{t). It is clear that the initialization of 6 can be provided if 
an estimation is already known and P(0) reffects the confidence that we have 
in this initialization. 

The matrix Q has a similar role to the forgetting factor (Soderstrom and 
Stoica, 1989). When Q is “large” or the forgetting factor low, the algorithm is 
brisk and may follow parametric variations; when Q is “small” (close to 0), or the 
forgetting factor near 1, the algorithm possesses good convergence properties 
(is convenient for time-invariant systems). The matrix Q can be provided by 
the user. 

Different policies of adaptation gain 

The original formula 



P~^{t) = P~^{t — 1) + (j)^ {t) (12.89) 

can be generalized in the form 

P~^(t) = \i(t) P~^(t - 1) + Mit) (j)(t) (j)^(t) (12.90) 



Nevertheless, the parameters Ai and A 2 must fulfil the following conditions: 
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• 0 < Ai < 1, otherwise the system is unstable. 

• 0 < A 2 < 2, so that the gain is decreasing. 

Using the matrix inversion lemma, we obtain the adaptation gain matrix 
according to the following algorithm 



p{t) 



1 

\i{t) 



Pit - 1 ) 



p{t - 1 ) (f){t) (p'^jt) p{t - 1 ) 



(12.91) 



In order to obtain a better condition number of the matrices, it is desirable to 
realize a factorization (examples are Cholesky or Householder). 

The recursive least-squares algorithm is then modified according to the fol- 
lowing equations 



< 



- 1) + P{t - 1) e{t) 
^ y{t) -e{t-l)(l>{t) 
ip4>'^{t)P{t - i)4>{t) 



Pit) 



1 

\i{t) 



Pit - 1 ) 



Pjt - 1) Ht) 4>^it) Pjt - 1) 

^^ + y(t)p(t-i)0(t) 



To start the parametric identification, two cases are possible: 



with: P(0) > 0 
(12.92) 



• No initial information concerning the parameters is available; then, it is 
recommended to choose a high initial gain; the initial matrix is equal to the 
identity matrix times a large scalar Go 



P(0) = GoI , with: Go = 1000 (12.93) 

• Beforehand, a correct parameter estimation has been performed; the gain 
must be chosen small 



P(0) = GoI , with: Go < 1 (12.94) 

or simply equal to the estimation error covariance matrix. The matrix P(0) 
is an image of the accuracy of the initial estimation. 

Many variants bearing on the adaptation gain are possible (Dugard and 
Landau, 1990; Landau, 1990): 



a) Decreasing gain 

Set the two parameters fixed and equal to 1 



Ai = 1 ; A 2 = 1 



(12.95) 
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We get the algorithm of recursive least squares (RLS). The weight is identical 
for all prediction errors in criterion J(t) 

t 2 

J(t) = ^ [y(*) - (12.96) 

i=l 

This algorithm is convenient for identihcation of time-invariant systems and 
self-tuning controllers. 

b) Fixed forgetting factor 

We set 

A2 = 1; Ai(t) = Ai (12.97) 

giving the recursive relation 

= Ai P~^{t — 1) + (j){t) {t) (12.98) 

A good choice is Ai close to 1 (e.g. between 0.95 and 0.99, even between 0.98 
and 0.995). The criterion to be minimized is 

^ r ^ 1 2 

J(t)=y^Ap* y{i) - {t)4>{i) (12.99) 

i=l 

which means that the largest weight is on the last prediction error and that it 
decreases from t. This gain is convenient for identihcation and adaptive control 
of slowly varying systems. The smaller the forgetting factor, the more rapidly 
the information contained in the observation vector is forgotten. 

If the system strictly remains at the steady-state, the observation vector is 
zero: (^(t) =constant, and this leads to an explosion of the adaptation gain. 
Thus, it is necessary from time to time to excite the system or to freeze the 
adaptation gain. 



c) Variable forgetting factor 

We set 



A 2 = 1 


(12.100) 


giving the recursive relation 




p-i(t) = Ai(t) P~^{t - 1) + 4>{t) 4F{t) 


(12.101) 


We can choose 




Ai(t) = Ao Ai(t — 1) + (1 — Ao) 


(12.102) 



with Ai(0) and Aq included between 0.95 and 0.99. Thus, the convergence of 
Ai(t) towards 1 is ensured when t becomes very large (Goodwin and Sin, 1984). 
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The criterion to be minimized is J(t) 



■nt) = E 



j=i 



y{i) - 



t 

with: P Ai(j - 1) = 1 
i=i 



(12.103) 

The variable forgetting factor allows us to forget the beginning of the data, 
which may allow us to adapt to a brutal variation of the system (on one step, 
a small forgetting factor will be taken, then it will be set again near 1). In 
general, the convergence is accelerated as the adaptation gain is large during a 
larger horizon. 

This algorithm is convenient for identification of time-invariant systems and 
self-tuning controllers. 



d) Constant trace 

The significance of a constant trace is a correction in the direction of the least 
squares with increase of the adaptation gain. The trace of matrix P(t) is chosen 
as constant, giving at the initial instant 



P(0) 



Go 0 . . . 0 

0 Go 0 : 



0 ... Go 



The general relation 

p-i(t) = Ai(t) P~^{t - 1) + \2{t) 4>{t) 4F{t) 



(12.104) 



(12.105) 



is used, with variable Xi{t) and X 2 {t) fulfilling the conditions 0 < Ai < 1 and 
0 < A 2 < 2 and, at each step, Xi{t) and X 2 {t) are adjusted so that 



trace of P{t) = nGo (12.106) 

A value of Go included between 0.1 and 4 can be chosen. 

The criterion is 



■m = E 



j=i 



n{i - 1) 







(12.107) 



with 



1 + M{t) (p'^jt) p{t - 1) (p{t) 
1 + p{t -i)(f>{t) 



(12.108) 



This algorithm is very much used and is convenient for identification and adap- 
tive control of variable parameter systems. 

The following algorithms are combinations or simplifications of the variants 
which have been previously described. 
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e) Decreasing gain or constant trace 

We go from the decreasing gain algorithm to the constant trace algorithm when 
the condition 

trace of P(t) < n Go ( 12 . 109 ) 

is verihed with n number of parameters and Go included between 0.1 and 4. 

The decreasing gain or constant trace algorithm is used for identihcation 
and adaptive control of variable parameter systems in the absence of initial 
information. 

f) Variable forgetting factor or constant trace 

We go from the variable forgetting factor algorithm to the constant trace algo- 
rithm when the condition 

trace of P{t) < nGo (12.110) 

is verihed. 

In addition to the previous discussions, the variable forgetting factor or con- 
stant trace algorithm is used for identihcation and adaptive control of variable 
parameter systems in the absence of initial information. 

g/ Constant gain 

Both parameters are hxed and equal to 

Ai = 1; A2 =0 (12.111) 

giving 

P{t) = ... = P{0) = al (12.112) 

In fact, we then obtain the improved gradient algorithm where the adaptation 
gain is scalar. Its advantage is its easy operation and its drawback a worse 
performance than that obtained for the forgetting factor or constant trace al- 
gorithms. It may be used for identihcation and adaptive control of stationary 
or variable parameter systems if the number of parameters of the system is low 
(n < 3). 

h) Scalar adaptation gain 

— The adaptation gain matrix is equal to the identity matrix times a variable 
scalar 

= (mi3) 

— When c{t) is constant, we hnd again the improved gradient algorithm. 

— If we choose c{t) = t, the gain is decreasing (stochastic approximation). 

— It is possible to take the recurrence 



c(t + 1) = c{t) + (j){t) 



(12.114) 




438 



Chapter 12. Parametric Estimation Algorithms 



— or still, 

c(t + 1) = Ai(t) c{t) + \ 2 {t)(jF{t) (j){t) (12.115) 

with the usual conditions 

c(0) > 0 ; 0 < Ai(t) < 1 ; 0 < \ 2 {t) < 2 (12.116) 

— If we choose c(0) = trace of P“^(0), we obtain c(t) = trace of P~^(t). 

The performances of the scalar adaptation gain algorithm are worse than 

those of matrix gain algorithms and counteract their ease of use. 

12.3.2 Recursive Extended Least Squares 

The recursive extended least-squares (RELS) method provides unbiased para- 
meter estimations for ARMAX models in the form 

^{q) y{t) = B{q) u{t) + C{q) e{t) (12.117) 

where we thus simultaneously identify the process and the disturbance. The 
method gives a prediction error which must tend towards the properties of 
white noise. For this model, the parameter vector 6 is given by Eq. (11.26) 
and the observation vector cj) by Eq. (11.25). The policies of adaptation gain 
described by algorithm (12.92) are adapted to this method. 

12.3.3 Recursive Generalized Least Squares 

The method of recursive generalized least squares (RGLS) provides unbiased 
parameter estimations for ARARX models in the form 

Mq) y(t) = q~’^ B{q) u{t) + g(t) (12.118) 

In this case, we introduce an auxiliary variable 

v{t) = A{q) y(t) - q-"^ B{q) u{t) = e{t) (12.119) 

resulting in 

v{t) + di u(t - 1) + . . . + dn,q v{t - Ud) = e{t) (12.120) 

This relation corresponds to an AR model. 

The predictor is introduced 

y{t) = -ai y{t - 1) - . . . - an,, y{t - Ua) + biu{t - 1) + . . . + bn,, u{t - Ub) 
-di v{t -1) - dn,i v{t - Ud) 

(12.121) 

which will give a white prediction error 



y{t) - y{t) = e{t) 



( 12 . 122 ) 
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For this model, the parameter vector 6 is equal to 

e = (12.123) 

and the observation vector cj) is equal to 



4>{i^ ^) = [-y(^-l)> ■ ■ ■ > -y{t-na), u{t-l), u{t-nb), -v{t-l), -v{t-nd)]'^ 

(12.124) 

with 

v(t) = A{q) y(t) - q~^ B{q) u{t) (12.125) 

In this form, it is possible to apply the modihed RLS algorithm (12.92). 

Soderstrom and Stoica (1989) present the RGLS method applied to an 
ARARX model in a different form. By introducing the polynomials 

F{q) = A{q) D{q) ; G{q) = B{q) D{q) (12.126) 

the process model becomes 

F{q) y{t) = q~^ G{q) u{t) + e{t) (12.127) 

which is simply an ARX model to which the RLS, for example, can be applied, 
the parameter vector being equal to 

0 = (12.128) 

and the observation vector (/) being equal to 

= [-y(t - 1), ■ ■ ■ , -y(t - Uf), u(t Ug)f (12.129) 

Notice that the parameter vector does not directly provide the coefficients of 
polynomials A(q) and B(q), but allows us to obtain the system transfer function 
[B{q)D{q)]/[A{q)D{q)]. 

12.3.4 Recursive Maximum Likelihood 

The recursive maximum likelihood (RML) method (Ljung and Soderstrom, 
1986) can be seen (Landau, 1990) as a modification of the RELS method. The 
proposed presentation is that of Ljung (1987). Consider an ARMAX model 

A{q) y(t) = B{q) u(t) + C{q) e(t) (12.130) 

To the observation vector (j)(t) of the RELS method, it makes the vector filtered 
by 1/C{q) correspond (it will have to be verified that the successive estimations 
of C{q) are stable) 

G{q)i){t) = 4>{t) (12.131) 

We introduce the residual 



= yit) - 4A{t)e{t) 



(12.132) 
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As a consequence, the observation vector is equal to 
lit - 1,6»), l{t-nc,9)Y' 



and the parameter vector equal to 



0 [<3-1, . . . , ^1? • • • 7 ^nt,? Cl, ... , 


(12.134) 


The predictor is equal to 




y{t,9) = 4>^{t)9{t - 1) 


(12.135) 


and the prediction error (different from the residual) 




e(^) = y{t) - y{t) 


(12.136) 



The algorithm follows a recursive Gauss-Newton scheme, which thus necessi- 
tates us to initialize sufficiently close to the optimal value 

R{t) = R{t - 1) + {t) - R{t - 1)] 

0(t) = - 1) +7(t)i?“i(t)'0(t)e(t) ^ ‘ ^ 

Frequently, in particular for RPE methods, Ljung and Soderstrom (1986) choose 
7 (t) = 1/t. It is necessary to initialize on a given horizon the RML method in 
order to estimate the polynomial C{q)^ which can be done by a RELS method. 
Landau (1990) recommends that this horizon is at least equal to three times 
the number of parameters to be estimated and to make a transition by go- 
ing from the RELS method to the RML method by replacing q by f3ci with 
0 < /d < 1 (variable contraction factor tending towards 1). On the other hand, 
the stability of polynomial C{q) must be verified. 



12.3.5 Recursive Prediction Error Method 

The recursive prediction error (RPE) method (Ljung and Soderstrom, 1986; 
Soderstrom and Stoica, 1989) is applied to the most general model 



yit) 









D{q) 



which is written again to isolate the noise term as 



A{q)D{q) 

C{q) 



y{t) 



B{q)D{q) 

F{q)C{q) 



u{t) + e{t) 



The best one-step predictor that can be obtained is 



y{t\^) 



A{q)D{q) 

C{q) 



, , Biq) Dig) , , 



(12.138) 



(12.139) 



(12.140) 
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The parameter vector to be estimated is equal to 

0 = [ai, . . . , , 6i, . . . , , /i, . . . , , Cl, . . . , Cfi^^ di, . . . , dfi^] (12.141) 



Contrary to the RLS method, the criterion to be minimized (Ljung, 1987) in 
RPE 






(12.142) 



i=l 



which is a weighted sum of the prediction errors, is not quadratic anymore for 
the general model with respect to 6>, so that the used recursive method can be 
considered as an approached solving method which determines an approxima- 
tion of the parameter vector 0. The algorithm will be a recursive Gauss-Newton 
algorithm. 

Introduce the successive auxiliary variables 



^) = (12.143) 

v(t, e) = A{q) y(t) - w(t, 6) (12.144) 

giving the prediction error 

e{t, 9) = y{t) - y{t\d) = v{t, 6) (12.145) 

In these conditions, the observation vector is equal to 



= [-y{t - 1 )> ■ ■ ■ > -y{t - na), u{t -i), ■■■, u{t- ub), 

—w{t — 1, 6>), . . . , —w{t — rif^ 6>), e{t — 1, 6>), . . . , e{t - 
-v(t -1,6'),..., -v(t - Ud, 9)]'^ 

Also introduce the polynomial 



ric,9), 



(12.146) 



G{q) = C{q) F{q) (12.147) 



The introduced vector 'ip is the opposite of the prediction gradient, which gives 
the search direction 






dy{t) 




dai 

dy{t) 




dbi 

dy{t) 




dfi 

dy(t) 




dci 

dy{t) 




. ddi _ 





D{q) 



y{t - i) 



C(q) 

e{t — i) 






C{q) 



v{t — i) 



(12.148) 
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The algorithm of RPE method is then described by the set of following equa- 
tions (Ljung and Soderstrom, 1986) 

e(^) = y{t) - y{t) 

R{t) = R{t - 1) + 7(t)[V’(t)V’^(^) - R{t - 1)] 

6{t) = 6{t - 1) + 

w{t) = biu{t - 1) + . . . + bn,,u{t - '>^b) - fiw{t - 1) - . . . - fnfW{t - Uf) 
v{t) = y{t) + aiy{t - 1) + . . . + an,^y{t - n^) - w{t) 

l{t) = v{t) + div{t - 1) + . . . + dn,iV{t - Ud) 

-cil{t - 1) - ... - Cn,e{t - lie) 

4>{t + 1) = [~y{t), . . . , -y{t -na + 1), u{t), . . . , it(t - nf, + 1), 

-w{t), . . . , -w{t - n/ + 1), e(t), . . . , e(t - nc + 1), 

. . ., -v{t -nd + 1)]^ 
y{t + l)= e{t)4>{t + l) 

y{t) = y{t) + diy{t - 1 ) + . . . + dnMt - nd) 

-ciy{t - 1) - ... - CnJ{t - Uc) 
u{t) = u{t) + diu{t - 1) + . . . + dn,iU{t - lid) 

-giu{t - 1) - ... - gn,u{t- Ug) 

w{t) = w{t) + diw{t - 1) + . . . + dn,iW{t - Ud) 

-giw{t - 1) - ... - gn,w{t - Ug) 
e{t) = e{t) - cii{t - 1) - . . . - CnJ{t - lie) 
v{t) = v{t) — Civ{t — 1) — ... — Cn^v{t — lie) 

1 p(t + 1) = [-y(t), -y(t -lla + 1), u(t), ...,u(t-llb + 1), 

-w{t), . . . , -w{t - Ilf + 1), e{t), e{t-iic + 1), 

-v(t), -v(t -lld + 1)]^ 

(12.149) 

Notice a difference between this algorithm and the immediately previous equa- 
tions: the prediction error e{t) (a priori) is replaced by the residual e{t) (a 
posteriori). 

By using P(t) = a factor of '0(t)e(t) in the recursive formula 

of 0{t), we notice that it is useful to realize a decomposition of this matrix 
to ensure that it remains positive definite along the iterations. Ljung (1987) 
proposes the Bierman algorithm to replace P by a product UDU^ (U upper 
triangular matrix and D diagonal). The matrix P is initialized as P = PqI 
with large Pq. The forgetting factor y(t) can be chosen, as in RML, to be equal 
to 1/t. 

Along the iterations, a stability test concerning polynomials C and F must 
also be performed. 

As the RPE method is very general, it is interesting to study some particular 
cases: 

• If F{q) = C{q) = D{q) = 1 (ARX model), then we obtain the recursive 
least squares (RLS). 

• If F{q) = D{q) = 1 (ARM AX model), then we obtain the recursive maxi- 
mum likelihood (RML). 
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• If A{q) = C{q) = D{q) = 1 corresponding to the following model 

y{t) = u{t) + v{t) (12.150) 

we obtain an output error method (Dugard and Landau, 1980). 

In general, the recursive prediction error (RPE) method converges faster than 
the recursive extended least squares (RELS) (Soderstrom and Stoica, 1989). 

It must be noted that the extended Kalman hlter, which is applicable to 
nonlinear models, can be used to estimate the parameters of the general model 
(Ljung, 1987). However, the RPE method ensures the convergence contrary to 
the extended Kalman hlter. 



12.3.6 Instrumental Variable Method 



In the previous chapter concerning the principle of instrumental variable method, 
two important properties were stated: 

• The estimator is given by 



^vi 



r iv i“C ^ 

t=i J t=i 



provided that the inverse matrix exists. 

• The instrumental variable vector must verify that 

r E[c(t)0^(t)] is nonsingular 
\ E[at)vo{t)]=0 



(12.151) 



(12.152) 



thus, contrary to the methods based on the minimization of the sum of 
the squares of the prediction errors, the instrumental variable method is 
based on the uncorrelation of the instrumental variables and the regression 
variables. 

The formula (12.151) suggests that it is possible to apply an algorithm that 
is similar to the recursive least-squares algorithm, so that the algorithm of 
the instrumental variable method (Ljung and Soderstrom, 1986) will be the 
following 



' m 

Lit) 

Pit) 



§it - 1) + Lit) [yit) -e’^it- 1) 4>it)] 
p(t)Cit) = ~ Ecjt) 

pit-mt) <p^it)Pit-i) 



Pit - 1) - 



i + <pTit)Pit-i)Cit) 



(12.153) 



This method presents many variants related to the choice of the instrumental 
variables of which a frequent possibility is cited in Sect. 11.4.5. Landau 
(1990) recommends initializing the method by the RLS method over an horizon 
equal to at least three times the number of parameters to be estimated. 




444 



Chapter 12. Parametric Estimation Algorithms 



12.3.7 Output Error Method 

This method, in the same way as the instrumental variable method, is based on 
the uncorrelation of the observations and the prediction errors (Landau, 1990). 
The vectors 0 and cj)(t) are given by Eqs. (11.48) and (11.49). An algorithm 
similar to RLS can be used. As the vector (j)(t) takes into account the model 
outputs instead of the actual outputs, this method is less sensitive to the output 
noise than the RLS method. 



12.4 Algorithm Robustification 

It is not sufficient to have chosen a good parametric adaptation algorithm. 
Some recommendations must be used in view of an efficient on-line use (Dugard 
and Landau, 1990; Isermann, 1991; Landau, 1988, 1990; Middleton and Good- 
win, 1990). 

In the case of RLS, to ensure parameter’s convergence towards their true 
value, the regression vector (j)(t) must fulfil the following condition (Middleton 
and Goodwin, 1990) of sufficient variation 

ti 

There exists t\ and e such that E {%) > el Vt (12.154) 

i=t 

and that the excitation is persistent: the input u must be such that the output 
varies sufficiently; the input is persistently exciting. These conditions are, in 
fact, desired in general. 

• The input and output signals having to possess a zero mean, their steady- 
state component must be eliminated, which is realized by subtracting their 
time mean. 

• An anti-aliasing filter must be incorporated in the system (refer to Shannon 
theorem in signal processing) in order to avoid distortion. 

• The signals in the significant bandwidth must be favoured: 

— Filter the low-frequency signals to avoid the drifts and load disturbances, 
and then work on the filtered signals. 

— Filter the high-frequency signals to eliminate the measurement noise 
and the unmodelled dynamics. 

• Data filtering: 

Suppose that the model before filtering includes a noise as 

yit) = + r](t) (12.155) 



— Gase of coloured noise: 

The coloured noise 77 is defined from its spectral characteristics, as filtered 
white noise e 



v{t) 



cjq) 

D{q) 



e{t) 



(12.156) 




Process Control 



445 



We prefilter the data to form 




D{q) 



(12.157) 



SO that the hltered estimation model simply makes white noise appear 



Vfit) = + e{t) (12.158) 



Then it will suffice to apply a recursive algorithm to the hltered variables 
Hf and <pf. 

— Case of a disturbance: 

In a process, it is necessary hrst to make an analysis of the most fre- 
quent disturbances. Then, the disturbance r]{t) can be modelled using 
a polynomial S{q) depending on the disturbance type (Middleton and 
Goodwin, 1990) such as 

S{q)r]{t) = 0 (12.159) 

A stable polynomial Q{q) is chosen, with degree larger or equal to that 
of /S, and the data are hltered 




Vf = 



S{q) 

Q{q) 



giving the hltered model without disturbance 






(12.160) 



(12.161) 



The polynomial S{q) is the internal model of the disturbance; e.g. for a 
step disturbance, which should always be taken into account 

S{q)=l-q-^ (12.162) 

and for a sinusoidal disturbance 

S{q) = 1 — 2 cos(cjo^s)^~^ + (12.163) 

• Data normalization: 

Consider the following process model 

A{q) y{t) = q-^ B{q) u{t) + w{t) (12.164) 



w(t) represents the unmodelled response of the actual system. We assume 
that this unmodelled part is low in mean with respect to the modelled part. 

To ensure the bounding of w(t)^ the data are normalized with the follow- 
ing normalization factor 

rj{t) = — 1) ^ g max[(^^ (^(t), go] with: 0</i<l; ^>0; go > 0 

(12.165) 
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giving the normalized variables 



“W 

Ht) = ; y{t) = ; (j){t) = 



VW) 



VW) 



VW) 



(12.166) 



and the new model 



A{(l)yit)=q B{q)u{t) +w{t) (12.167) 

hence 

y{t) = 9^ ^{t) ^ w{t) (12.168) 

The adaptation algorithm with the normalized data will be the same as the 
one with the raw data. We obtain 



m 



P{t - e°{t) 

Pit - 1) (pit) e°{t) 

r}it)P4>'^it)Pit-l)(l>it) 



(12.169) 



Pit) 



1 

\xit) 



Pit - 1) 



Pjt - 1) (pjt) (p'^jt) Pjt - 1) 

^ v{t) + <Fit) Pit - 1) 4>it) 



(12.170) 



• Case of time-varying parameter systems: 

In the case of systems with time- varying parameters 6(t)^ there exists no 
stationary value 6. This technique can also be used to take into account 
unmodelled dynamics. 

We use the following prediction of the parameter vector 



9{t) = G 9{t — 1) ^ P{t — 1) (j){t) e{t) with : 0.95 < a < 0.99 (12.171) 

which amounts to replacing the integrator bearing on 9{t) by a first-order 
filter; a is called a contraction factor. 

Let 9{t) be the error committed on the parameter vector (of unknown 
theoretical value 9) 

9{t) =9{t) -9 (12.172) 

With the contraction factor, we obtain the error 



9{t) = 9{t - 1) + P{t - 1) (j){t) e{t) -{1-g)9 (12.173) 

The term (1 — a) 9 constitutes a drift error, which is the counterpart due to 
the better stability. 

• Various techniques: 

Different other techniques exist that allow us to improve the behaviour of 
parametric adaptation algorithms, e.g. 

— The use of the “dead zone”: consisting of freezing the value of the para- 
meter vector when the prediction error is larger than the noise amplitude. 
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— The use of a projection sphere: the parameters are forced to remain 
inside a domain by projection on the boundaries if their estimation is 
situated outside. 

— Factorization of the adaptation gain: the objective is to ensure that the 
gain P{t) is a positive dehnite matrix for all t (Cholesky decomposition, 
etc.). 

12.5 Validation 

In the case of off-line identihcation, the set of input-output data must be sepa- 
rated into two parts: the hrst one being used for identihcation, the second for 
validation. 

The aim of validation is to judge the quality of the identihcation method 
and the quality of the obtained model. The model must present a compromise 
between its htting to input-output data and its complexity, which must not be 
too important (risk of over-parameterization). 

Two classes of identihcation methods have been introduced: 

• Methods based on the hypothesis that the prediction error must have the 
properties of white noise. This is the case of: Recursive Least Squares 
(RLS), Recursive Extended Least Squares (RELS), Recursive Generalized 
Least Squares (RGLS), Recursive Maximum Likelihood (RML), Recursive 
Prediction Error Method (RPE). 

To verify if the prediction error approximates white noise, the autocorre- 
lation of the prediction error is estimated 

1 ^ 

^ (12.174) 

^ k = l 



and normalized 

«,.(.) = ||I (12.175) 

Theoretically, if it were effectively white noise, we would obtain Rn{0) = 
1 and \Rn{i)\ = 0, i > 0. In fact, considering that the autocorrelation 
follows a Gaussian distribution of zero mean and standard deviation 
a statistical criterion at 5% ce-level is that the autocorrelation must then be 
such that 

1.96 

Rn{0) = 1 ; \Rn{i)\ < —j= > 0 (12.176) 

V N 

with: N number of measurements. 

• The methods that are based on the hypothesis that the observation vector 
and the prediction error are not correlated. This is the case of instrumental 
variable and output error methods (Landau, 1990; Ljung, 1987; Ljung and 
Soderstrom, 1986; Soderstrom and Stoica, 1989). 
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12.6 Input Sequences for Identification 

To obtain the convergence of the parameter vector of the model towards the 
true value, which is to realize good identification, it is important to choose an 
input sufficiently rich in frequencies. A type of input frequently used for its 
richness is the pseudo random binary sequence (PRBS) (Landau, 1990), which 
is formed by rectangular impulses of variable duration and of amplitude that is 
alternatively +1 or — 1 , the mean amplitude for the full sequence is practically 
zero, which allows us to limit the amplitude of the disturbance introduced on 
the process during identification. Other types of inputs can be used, such as 
rectangular impulses of variable length and amplitude, allowing us to easily 
detect non-linearities. A PRBS-type input approximates white noise (recall 
that white noise is characterized by a constant spectral density). In the case 
of the PRBS, the spectral density will be practically constant until about 0.3 
times the sampling frequency /e. 

A program generating a PRBS can be realized according to the scheme in 
Fig. 12.3. 



Bo 



Bi 


B2 


Bs 


B4 


r 


+ 

+ 





B 



Addition Modulo 2 



Figure 12.3: Generation of a pseudo random binary sequence for = 4 

If an integer N is chosen, the maximum length of the PRBS is 2 ^ — 1 (then 
the sequence is periodical). 

In Fig. 12.3, each box or register Bi represents a bit (value 0 or 1). The 
PRBS is represented by the values of the output B equal to B/^ along the 
iterations. The addition modulo 2 of B/^ and Bs gives Bq equal to Bi (if B/^ 
and Bs are both equal to 0 or 1 , is 0 ; if B 4 and Bs have different values, Bq is 
1). To go from an iteration to the following iteration, it suffices to translate the 
values in the registers: Bi takes the value of Bq which has just been calculated; 
B 2 takes the value of 5i, . . ., ^ 4 , thus B takes the value of ^ 4 . In the case of 
Fig. 12.3, N = A was chosen, which corresponds to a maximum length of the 
sequence equal to 15. 

Table 12.1 gives the characteristic polynomials of the PRBS, which allows 
us to build schemes such as Fig. 12.3 for different lengths of PRBS. In the case 
where = 4, the addition modulo 2 thus gives 

[i?** 0 0 R°]Bi = 0 (12.177) 

where R is a backward shift operator such that 

WBi = Bi^j 



(12.178) 
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Table 12.1: Characteristic polynomials of PRBS of maximum length 



N 


Characteristic 

polynomial 


Maximum 

length 


Output periodic sequence 


2 


r:^(br^(b r^ 


3 


no 


3 


R^ (BR'^ (B R° 


7 


1110010 


4 


R‘^(BR^(B R° 


15 


111100010011010 


5 


R^ ®R^ ® R° 


31 


1111100011011101010000100101100 


6 


R^(BR^(B R° 


63 


11111100000100001100010100. . . 



The addition rule (12.177) can be written in an equivalent manner 

Bi = R‘^ Bi ® Bi (12.179) 

thus 

R^Bo=Bz®Bi=R^[Bi®B3] (12.180) 

hence 

= ^4 0^3 (12.181) 

which indeed corresponds to the concerned scheme in Fig. 12.3. 

If we begin by the sequence 1111 (each register contains the value 1), the 
output of the obtained sequence of maximum length with N = A will be 

nil 000 1 00 11 0 1 0 (12.182) 

and the PRBS would be (replacing values 0 by —1) 

nil -1-1-11 -1-111 -11 -1 (12.183) 

Let Tg be the sampling period. To allow correct identihcation, at least 

one of the rectangular pulses must have a length larger than the rise time r 
characteristic of the process. As the maximum length of a rectangular pulse of 
the PRBS is N sampling periods, the following rule results 

NTs>r (12.184) 

On the other hand, the duration of the identihcation tid must, of course, 
be larger than the duration of the PRBS to cover all the frequency spectrum 
generated by the PRBS, thus a second rule results 

tid>{2^-l)Ts (12.185) 

As the hrst rule could end at too high values of N leading to an excessive 
duration of the PRBS, often an under- multiple of the sampling frequency is 
used as the basis frequency for the PRBS 

i^PRBS = — withip = 1 , 2 ,... 

P 



(12.186) 
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giving the new rule, replacing the first rule 

pNTs>r (12.187) 

By using p > 1, we reduce the frequency spectrum of the PRBS where the spec- 
tral density is approximately constant. In general, this is not too important, 
provided the process has a low bandwidth. 

12.7 Identification Examples 

12.7.1 Academic Example of a Second-Order System 

Example 12.3: Identification of a Second-Order System 

To display some identification characteristics, first consider a continuous sys- 
tem, of transfer function 



- 9s2 + 3s + 1 



(12.188) 



which is underdamped second-order. This transfer function has been discretized 
by a zero-order holder with a sampling period Tg = 0.5, giving the discrete 
transfer function 



B{q) _ 0.06559^ + 0.06204 

A{q) ~ q^ - 1.8210^ + 0.8465 



(12.189) 



The response to a unit step is represented in Fig. 12.4. Now, we will only 
consider this discrete system. 




Figure 12.4: Response of a continuous second-order system and the discretized 
system to a unit step input 

An ARMAX model corresponding to this transfer function B{q)/A{q) was built 
by adding a noise term C{q) 

y{t) - l.S210y{t - 1) + 0.8465^(t - 2) = 0.06559ix(t - 1) + 0.06204ix(t - 2) 

+e(t) - l.le(t - 1) + 0.3e(t - 2) 



(12.190) 
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where e{t) is Gaussian white noise of hxed standard deviation erg (here, erg = !)• 
This model was subjected to a pseudo random binary sequence (PRBS) 
of characteristic number = 5, whose base period was taken to be equal 
to four times the sampling period. This model will be subsequently called 
simulated or pseudo-experimental system. The input and the output of the 
simulated system are represented with respect to normalized time (one unit = 
one sampling period) in Fig. 12.5. 





Normalized Time 



Figure 12.5: Pseudo random binary sequence (top) and response (bottom) of 
the discrete system to this PRBS 

The input-output data thus obtained were hrst identihed by a recursive ex- 
tended least-squares method with an ARMAX model with the same number of 
parameters by using a constant forgetting factor A = 0.99. 




Normalized Time 



Figure 12.6: Evolution of the parameters of polynomials A{q) and B{q) of 
the identihed ARMAX model (RELS method) in the case where cTg = 1 
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After a transient period, the parameters converge rather quickly (Fig. 12.6). 
The output obtained from the identified model can be compared to the simu- 
lated experimental output in Fig. 12.7. In spite of the important noise which 
concerned the simulated system, the dynamics of the identified model is near 
that of the simulated system. The prediction error (Fig. 12.8) was calculated 
and we must verify that it presents properties close to white noise. The au- 
tocorrelation of the prediction error (Fig. 12.9) is included in the confidence 
interval at 5% ce- level. 

8 
6 



O 



-4 
-6 
-8 

O 20 40 60 80 lOO 120 
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y predicted 
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Figure 12.7: Comparison of the predicted output and the experimental output 
(RELS) in the case of a standard deviation of the simulation model ae = 1 



Several criteria have been calculated to characterize the identification per- 
formance: 

— The variance of the prediction error Ci = . 

— Akaike’s criterion of theoretical information (Soderstrom and Stoica, 1989): 

C 2 = JVlog((7^) + 2dm (12.191) 



— Akaike’s criterion of final prediction error (Ljung, 1987) equal to 



C3 



1 + dm/N 2 

1 - 



(12.192) 



where dm is the model dimension, thus the number of parameters to be 
determined. 

The evolution of these criteria was studied with respect to: 

— The standard deviation ae of the noise e{t) used to generate the simulated 
system outputs. Table 12.2 clearly shows that, before realizing identifi- 
cation, it is useful to filter the data or even better still, to realize direct 
filtering of the sensors of the system. 

The values of the final parameters (the length of the input sequence is only 
equal to 124) obtained for the model identified by the RELS method are 
given for these different values of erg, as well as for the RML and RPE 
methods. Of course, the smaller the noise, the closer the final parameters of 
A{q) and B{q) are to their theoretical value. Even when the final parameters 
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Normalized Time 



Figure 12.8: Prediction error (RELS) in the case where Gq = 1 



Identification 




Normalized Time 



Figure 12.9: Autocorrelation of the prediction error (RELS) in the case where 

CTe = 1 



Table 12.2: Influence of the standard deviation of the error of the ARM AX 
model on the identihcation result expressed by criteria Ci 



RELS method (with A = 0.99) 


(7e = 1 


CTe = 0.5 


CTe = 0.25 


(7e = 0.10 


Cl 


0.912 


0.269 


0.0830 


0.0138 


C 2 


8.531 


-143 


-289 


-511.2 


C 3 


1.072 


0.317 


0.0975 


0.0162 



are far from their theoretical value, the dynamics of the identihed model is 
correct. On the other hand, of course, the predicted output is much closer 
to the experimental output (Eig. 12.11) when the noise is smaller. 

— The value of the constant forgetting factor A (Table 12.4): the increase 
of A beyond 0.99 brings little improvement, on the other hand its decrease 
leads to a neat deterioration. 

On the other hand, different versions of the RPE method taking into account 
different forgetting factors or different decompositions of the matrix P, have 
given results rather close to previous RPE method. Eigure 12.10 shows the 
predicted output by means of the model identihed by the RPE method in the 
case where = 1. 

To improve the estimation results, it is possible to proceed in several runs: after 
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Table 12.3: Values of the parameters of the simulated model and identified 
models. Value of the prediction error criterion 





ai 


CL2 


bi 


h2 


Criterion 

Cl 


Theoretical 

parameters 


-1.8210 


0.8465 


0.06559 


0.06020 




RELS (cTe = 1) 


-0.91552 


-0.02954 


-0.18114 


0.65439 


0.912 


RML (cTe = 1) 


-0.89830 


-0.04031 


-0.18298 


0.66012 


0.913 


RPE be = 1) 


-1.05677 


0.10273 


-0.29681 


0.80870 


0.985 


RELS (cTe = 0.5) 


-1.18897 


0.23026 


-0.04587 


0.37495 


0.269 


RELS (cTe = 0.25) 


-1.50796 


0.54010 


0.02823 


0.19856 


0.0830 


RELS be = 0.10) 


-1.77681 


0.80251 


0.05939 


0.08889 


0.0138 


RELS (cTe = 1) 
with 20 runs 


-1.38486 


0.42463 


-0.12837 


0.47795 


0.8564 



Table 12.4: Infiuence of the forgetting factor in the RELS method on the 
identification result 



RELS method (with = 1 ) 


A = 0.95 


A = 0.99 


A = 0.995 


Cl 


1.014 


0.912 


0.910 


C2 


21.68 


8.531 


8.279 


C3 


1.191 


1.072 


1.069 



having used the data set for a first parametric estimation, the last estimations 
are kept and used as initialization for a new estimation run. In this manner, 
the recursive identification approaches off-line identification (Ljung and Soder- 
strom, 1986). This method was applied to a set of initial small size {N = 124) 
which had already been used and resulted in an improvement of the criterion Ci 
from 0.9117 to 0.856 and the parameters (Table 12.3) in a significant manner 
(Fig. 12.12). 
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Figure 12.10: Comparison of the predicted output and of the experimental 
output (RPE method) in the case where de = 1 




Normalized Time 



Figure 12.11: Comparison of the predicted output and the experimental out- 
put (RELS method) in the case where erg = 0-25 




Normalized Time 



Figure 12.12: Evolution by several runs of the parameters of polynomials 
A(q) and B(q) of the identihed ARM AX model (RELS method) in the case 
where de = 1 
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12.7.2 Identification of a Simulated Chemical Reactor 

Example 12.4: Identification of a Chemical Reactor 




A chemical reactor with a jacket was 
simulated by means of a nonlinear 
knowledge model (Sect. 19.2). The che- 
mical reaction is considered as a distur- 
bance and only the thermal behaviour 
of the reactor is analyzed. 



Concerning control, the input is the position of a three-way valve which drives 
the temperature of the heating-cooling fluid entering in the jacket, and the 
output is the temperature of the reactor contents. This corresponds to a lab 
pilot reactor. A pseudo-random binary sequence is imposed on the input after 
the steady-state has been reached for an input equal to the mean amplitude of 
the PRBS. The base period of the PRBS is 250 s and the sampling period is 5 s. 
The first model identified by the RELS method with a forgetting factor equal 
to 0.99 is an ARMAX model with degrees of A{q), B{q),C{q) all equal to 1, 
thus only one coefficient was identified for each polynomial. We notice that the 
thermal dynamics of the reactor is very correctly respected (Fig. 12.13). If the 
degree of all polynomials is increased to 2, the difference becomes very small 
and non-perceptible in the figure. The reactor has indeed a behaviour close 
to a first-order system. An ARX model identified by RLS also gives nearly 
identical results. 

The same knowledge model for the reactor was used, but the output tempe- 
rature was polluted by realistic Gaussian white noise of standard deviation 0.5. 
The identification was performed in a parallel manner. In this case, with an 
identical number of parameters and n^, the ARX model gives slightly less 
satisfactory results than the ARMAX model. On the other hand, the ARMAX 
model (denoted by ARMAX 2-2) with the degrees of A{q)^ B{q)^ ^{q) equal 
to 2 is very slightly worse than the ARMAX model (denoted by ARMAX 1-1) 
with the degrees of A{q), B{q),C{q) equal to 1 (Fig. 12.14). This is confirmed 
by the values of the criteria obtained for the reactor with noise, gathered in 
Table 12.5. All models, in the cases without and with noise, have in common 
possession of a pole z ~ 0.976. 
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Figure 12.13: Comparison of the temperature predicted according to an AR- 
MAX model identified by the RELS method and of the experimental tempera- 
ture of the reactor contents (case without noise) 




Figure 12.14: Comparison of the temperature predicted from an ARM AX 
model identified by the RELS method and of the experimental temperature of 
the reactor contents (case of the output with noise). The line of the ARM AX 
1-1 model is slightly closer to the experimental values than the line of the 
ARMAX 2-2 model 



Table 12.5: Values of the identification criteria for a reactor with output 
noise 



Reactor with noise 


ARX 1-1 


ARX 2-2 


ARMAX 1-1 


ARMAX 2-2 


Cl 


0.497 


0.391 


0.290 


0.335 


C2 


-1009 


-1350 


-1784 


-1565 


C3 


0.499 


0.394 


0.292 


0.340 
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Part IV 

Discrete Time Control 




Chapter 13 

Digital Control 



This chapter mainly concerns pole-placement control in the case of discrete 
time. By its general character, pole-placement control is very important; in- 
deed, it can cover other specialized types of control such as discrete BID, linear 
quadratic control studied in Chap. 14 in relation to optimal control, genera- 
lized predictive control studied in Chap. 15, model predictive control studied 
in Chap. 16. Discrete internal model control will also be examined, as well as 
general characters of adaptive control. 



13.1 Pole-Placement Control 

13.1.1 Influence of Pole Position 

In continuous time, the influence of the position of the poles of the closed- 
loop transfer function on the overshoot, the rise time and the settling time 
was studied (Fig. 4.2). Using the correspondence z = exp(5T5), it is possible 
to obtain in discrete time an analogous hgure in the unit circle of the complex 
plane. The drawing in Fig. 13.1 was realized with the following values: ( = 0.7, 
= 1, Tg = 1. Notice that the fast poles correspond to z ^ 1 with the 
integrator pole z = 1. The results in this Figure can be compared with the 
step responses in Fig. 9.21 with respect to the pole position. 

13.1.2 Control Synthesis by Pole-Placement 

Pole-placement control was already studied in the continuous case in Chap. 
4. As Astrom and Wittenmark (1989) mentions, model reference control is 
considered as a particular case of pole-placement and is considered in the same 
framework. 

The pole-placement controller or RST controller, is described by the cano- 
nical structure in Fig. 13.2. It can be used for unstable or stable systems as 
well. 
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Figure 13.1: Loci of constant overshoot, rise time and settling time in the 
complex plane in discrete time. The dashed zone is not convenient for the 
poles 



It is possible to present pole-placement with the forward shift operator q (As- 
trom and Wittenmark, 1989) or with the backward shift operator q~^ (Landau, 
1990), and also with the S operator (Middleton and Goodwin, 1990). This last 
choice introduces a better numerical robustness. The choice of operator q or 
q~^ influences, in particular, the solving of the Bezout equation (De Larminat, 
1993). 

In this introductory part, in order to avoid problems due to the use of 
the operator q~^^ the operator q will be used. Moreover, the parallel with 
continuous pole-placement (Sect. 4.9) becomes still more obvious. 

The process transfer function can be represented as 



H{q) 



yjt) 

u{t) 



B{q) 

A{q) 



(13.1) 



— q^ + aiq^ ^ + . . . + an-iq + a,n 
= hiq^ ^ + . . . + hn-iq + bn 



with 



A{q) 

B{q) 



(13.2) 
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Figure 13.2: Pole-placement or RST control 



so that 

y{t) _ biq^~^ + . . . + bn-iq + bn _ + . . . + 

u{t) q'^ + aiq'^-^ + . . . + an-iq + an 1 + + . . . + + a„g“" 

(13.3) 

The transfer function H{q) is strictly proper: deg A > deg 5. The coefficient bo 
is zero to emphasize that the output is always delayed by at least one sampling 
period with respect to the input. The system is causal. There always exists a 
delay equal to at least one sampling period; if, moreover, the system presents an 
effective delay d, the d following coefficients 5i, 62 , • • • are zero. The polynomial 
A{q) is monic (uq = !)• The corresponding difference equation is 

y{t) + aiy{t - 1) + . . . + any{t - n) = biu{t - 1) + . . . + bnu(t - n) (13.4) 



The polynomials B{q) and A{q) are assumed to be coprime. 

Note in the same manner the polynomials R{q)^ S{q), T{q) 

R{q) = roq"^ + riq^~^ + . . . + Vm-iQ + Tm 

^{q) = + siq^ ^ + . . . + Sm-iq + Sm (13.5) 

^(^) = Toq^ + . . . + Tjn-iq + 



so that the controller is proper: deg S > deg R and deg S > deg T. According 
to the block diagram in Fig. 13.2, the control law is 

where yr{t) is the reference. 

The closed-loop output results 



y{t) 



y (t) + 

A{q)S{q)+B{q)R{q)y^^>^ 

A{q)S{q) + B{q)R{q) ^ 



M<l)S{q) 

A{q)S{q) + B{q)R{q) 



d{t) 



(13.7) 




466 



Chapter 13. Digital Control 



The pole-placement means that the closed-loop poles are specified, corres- 
ponding to the zeros of the polynomial P{q) defined by the Bezout equation 

P{q) = A{q)S{q) + B{q)R{q) (13.8) 

with 

P{q) = PoqP + Pi/-' + . . . + Pp-iq + Pp (13.9) 

From the equality of the degrees of P and the product AS^ we deduce p = 
n-\-m. We reason as if were unknown. The number of equations corresponds 
simultaneously to the number of unknown coefficients of polynomials R{q) and 
S{q), and to the number of specified coefficients of polynomial P{q). So that the 
solution is unique, we impose p-\-l = 2m + 2, hence p = 2n — l,m = n — 1, thus 
for a simply proper controller deg P = 2deg A—1 and deg S = deg R = deg A—1. 

Given the polynomial P{q) and a proper controller, the solution of the 
Bezout Eq. (13.8) gives the polynomials R{q) and S{q) by solving the following 
system 




n columns n columns 

(13.11) 

where R{q) and S{q) follow Eq. (13.5) with m = n — 1. In fact, in the previous 
system uq = 1, = 1 and Pq = 1. The matrix to be inverted during the solving 

is a Sylvester matrix of dimension (2n) x (2n), for this proper controller. 
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So that the Sylvester matrix is nonsingular, thus invertible, it is necessary and 
sufficient that the polynomials A{q) and B{q) are coprime. 

A strictly proper controller (deg S = degR-\- 1) offers better high-frequency 
behaviour. It must verify deg S = deg A, deg R = deg A—1 and deg P = 2deg A. 
In the case of a strictly proper controller, the calculation is performed in a very 
similar manner with the Sylvester matrix of dimension (2n + 1) x (2n + 1), and 
coefficient tq = 0 as follows 




Again, in this system, ao = 1, = 1 and Pq = 1. R{q) and S{q) are given by 



R[q) =nq^ 2 + . . . + r„ 

= </” + ^ + . . . + Sn-iq + Sn 



with 



ao 0 

ai ao 



0 



0 



0 

0 

bi 0 




C-n — 1 






ao 


0 


an 


C-n — 1 






ao 


0 


an 


dfl-l 




ai 






an 







bn-2 ... bi 0 0 

bn-1 bn-2 ... bi 0 



bn bn-1 ... bi 

0 bn ... b2 




(13.14) 



It is more customary to use the polynomials with the operator q~^ rather 
than q] note that Astrom and Wittenmark (1989) and Middleton and Goodwin 
(1990) use the operator q and that De Larminat (1993) even recommends its 
use, although it is not causal (as well as the Laplace variable 5 ). For a transfer 
function whose numerator and denominator are formally polynomials of equal 
degree in q and q~^^ the representation is equivalent. 
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If we had used the polynomials with the backward shift operator q we 
would have found a solution to the Bezout equation 



P{q-^) = A{q-^)S{q-^) + B{q-^)R{q~^) (13.15) 



for a polynomial P{q~^) of any degree, which is not the case for Eq. (13.8). 
In fact, it is not desirable that P{q~^) is of any degree, as a too low degree of 
P{q~^) with respect to the previous conditions leads to implicit placement of 
poles at the origin. If on one hand the degrees of A{q~^) and of B{q~^) are 
identical and on the other hand those of S{q~^) and R{q~^) are also identical, 
the solving of the Bezout equation in q or in q~^ is equivalent. 

In the case where the operator q~^ is used, the system is modelled by 



) = -TTV = = g 



u{t) A{q-^) 
with the delay d and polynomials 



A{q-^) 



— 1 aiq ^ Gn^-iq P ctnaQ. 

B\q-^) = 6 ' + . . . + q-< 

b b 

The control law is 

S{q~^)u{t) = T{q~^)yr{t) - R{q~^)y{t) 



(13.16) 



(13.17) 



(13.18) 



where the polynomials R{q and S{q are obtained by solving the Bezout 
equation 

P{q~^) = A{q-^) S{q-^) + q-^~^ B\q-^) R{q~^) (13.19) 

and the closed-loop output is equal to 



y{t)= 



-d-1 



B\q-^)T{q-^) 



A{q-^) S{q-^) + q-^-^ B'{q~^) R{q~^) 



yr{t)=^ 



-d-l 



B'{q-^)T{q-^) 



P{q-^) 



Vrit) 



(13.20) 



Example 13.1: Comparison Between q and q ^ Models 

Consider the transfer function expressed by q 



yd) ^ , B{q) ^ + Og + 0.5 

u{t) A{q) q^-0.5q + 0A 



(13.21) 



Formally, A{q) and B{q) have same degree 2. However, the first two coefficients 
of B{q) are zero, thus the delay of y{t) with respect to u{t) is two sampling 
periods, including the normal delay of one sampling period of the output with 
respect to the input. The following transfer function in q~^ is equivalent 



yjt) 

u{t) 






= q~ 



1 -0.5g-i +0.3g 



O + Og-i +0.5g-2 
1 -0.5g-i +0.3g-2 



-2 



(13.22) 
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where, besides the normal delay, there exists an effective delay d = 1 of one 
period. 

In a general way, if we want to use the same coefficients in polynomials A 
and B whatever they are in q or in they must be written as 



p(t) = 



A(q-^) 



u(t) y(t) = 



B(q) q- 



nnb 



A{q) 



u(t) = q^ 



;-nbB{q) 

A{q) 



u(t) (13.23) 



where ria is the degree of polynomial A, of polynomial B. 



Regulation Behaviour 

In order to guarantee a zero steady- state error, in general, an integrator is 
included in the controller so that the polynomial S{q) is replaced by Hi{q)S{q) 
with Hi{q) = q — 1. The addition of an integrator in the controller aims to 
reiect the constant disturbances in agreement with the internal model principle 
(Sect. 5.9). 

On the other hand, a robustness filter is also added in the feedback loop 
(Fig. 13.3), transforming polynomial R{q) into H 2 {q)R{q) with 

H2{q) = ; 0 < a < 1 (13.24) 

r — a 

In these conditions, the Bezout equation becomes 

P{q) = A{q) H^{q) S{q) + B{q) H 2 {q) R{q) (13.25) 




Figure 13.3: Pole-placement control with integrator and robustness filter 



Tracking Behaviour 

The reference trajectory yref{t) is either the set point yr{t) or defined with 
respect to a reference model (Fig. 13.4) 



Bmjq) 

Am ((/) 



Hm{q) 



(13.26) 
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yr{t) 


Bm{q) 


Vrefyi + d + 1) 


T{q) 






Am {q) 







e{t) 



Um (t) 



Figure 13.4: Reference trajectory and set point 



that 



yrefit)=q-^-^^^yr{t) 



Urefit +d + l) — 



^ An^iq)' 

in order to take into account the delay d + 1, with the polynomials 



nm—l 



Am{q) + 

Bm{q) = + bmiq 






,nm—l 



bn 



(13.27) 



(13.28) 



The polynomials and can be obtained from a continuous transfer func- 
tion, e.g. derived from the ITAE criterion, then discretize, for example, by a 
zero-order holder or the Tustin transformation. 

If we separate the polynomial T{q) from the reference model BmjAm as in 
Fig. 13.4, to guarantee a unit gain from Eq. (13.7), we impose 



^ ’ \ P{z) if: B{1) = 0 



(13.29) 



Astrom and Wittenmark (1989) integrate the reference model BmjAm in 
the polynomial T{q)] in these conditions, for the gain to be 1, we need 



f B{z)T{z) Bm{z) \ 
\ P{z)A^{z) 



(13.30) 



13.1.3 Relation between Pole-Placement and State Feed- 
back 

In this section, where we insist on causal relations and where the objective is to 
establish an analogy between pole-placement and state feedback, the operator 
q~^ is used. According to Fig. 13.2, we introduce the partial state z(t) as 

A{q~^)z(t) = u(t) ; y(t) = B{q~^)z(t) (13.31) 

It is possible to factorize the pole-placement polynomial P{q~^) as 

P{q-^) = A,{q-^)[A{q-^) + K{q~^)] (13.32) 

with the polynomial K{q~^) being of degree n such that 



K{q-^)=kiq-^ + ... + kn-iq-^+^ 



+ knq ” 



(13.33) 
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and Ao{q being of degree n — 1. We then define a polynomial P 2 {q such 
that 

P2{q~^) = S{q~^) - Ao{q~^) (13.34) 

Using Eqs. (13.18) and (13.27), we deduce 

[Ao{q~^) + P2{q~^)]u{t) = T{q~^)yr{t + d + 1) - R{q~^)y{t) (13.35) 

By introducing the partial state z(t), we have 
Ao{q~^)u{t) = T{q~^)yr{t + d + 1) - [P2{(i~^)A{(i~^) + R{q~^)B{(i~^)]z{t) 



P 2 {q-^)A{q-^) + R{q-^)B{q-^) = [S{q~^) - Ao{q-^)]A{q~^) + R{q-^)B{q~^) 

= P{q-^) - A,{q-^{q-^) 

= A,{q-^)K{q-^) 

(13.37) 

we obtain 

[P 2 {q-^)A{q-^) + R{q-^)B{q-^)]z{t) = Ao{q-^)K{q-^)z{t) 

= P 2 {q ^)u{t)+R{q ^)y{t) 

(13.38) 

and 



Ao{q ^)u{t)=T{q ^)yr{t + d + 1) - Ao{q ^)K{q ^)z{t) 
u{t) = + d + l)~ K{q~^)z{t) 



(13.39) 



Equations (13.38) and (13.39) correspond to the scheme in Eig. 13.5. As the 
polynomial Ao{q~^) allows us to estimate the partial state vector z{t) according 
to Eq. (13.38), it is called the observer polynomial and specifies the observer 
dynamics. 

The closed-loop equation of the system can then be formulated as 



/ Ao{q ^)[A(^ ^)+K{q ^)]z{t)=T{q ^)yr{t + d + 1) 

I y{t) = B{q~'-)z{t) 



Indeed, we verify that the denominator of the closed-loop transfer function is 
equal to the pole-placement polynomial 

A,{q-^)[A{q-^) + K{q-^)] = P{q~^) (13.41) 



The system can also be expressed in state-space equations. By setting the 
state vector composed by the partial states z(t), according to Eq. (13.31) 

x(t) = [z(t — 1) . . . z(t — n)]^ 



(13.42) 
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Process 




Figure 13.5: Pole-placement representation with partial state z{t) and the 
observer polynomial Ao{q~^) 



Process 




Figure 13.6: State-space representation of pole-placement with partial state 
z{t) and the observer polynomial Ao{q~^) 



the model is written in the controllable form as 





— ai 


dn 








1 0 


0 




1 ' 
0 


x(t “h 1) — 


0 : 

: 0 


x(t) + 


_ 0 _ 




0 


0 1 0 J 







y{t) = [6i . . . bn] x{t) 



u{t) 



(13.43) 




Process Control 



473 



By setting as a state feedback gain vector equal to 

K=[ki ... kn] (13.44) 

Eq. (13.38) can be seen in the form 

Ao{q~'^)K{q~'^)z{t) = Ao{q~'^)Kx{t) = P 2 {q~'^)u{t) + R{q~'^)y{t) (13.45) 

assuming that all the states x(t) are measurable. The equation expressing the 
state feedback (13.39) is then 



u{t) = + d + 1) - Kx{t) 

By using the state feedback (13.46), the closed-loop model becomes 



(13.46) 



x(t T 1) — 



y{t) = [h... bn] x{t) 
by setting 



ci\ k\ ... Qjfi kfi 






1 0 ... 0 




1 






0 


0 : 


x(t) T 




: 0 




0 


_ 0 0 10 





We deduce the characteristic polynomial 

z'^ + (ai + k\)z^ ^ + . . . + (utt, + kfi) 



(t + d + 1) 

(13.47) 

(13.48) 

(13.49) 



which shows that the closed-loop poles can be specihed by means of coefficients 
ki. 

In the common case where the states x{t) are not all measurable, it is 
necessary to use an observer to estimate the states x{t) (Goodwin and Sin, 1984; 
Vidyasagar, 1985). The system is described in state space in the observable 
form and with gain L of the observer 





—ai 1 0 

: 0 ■ 


.. O' 




' hi 


Xo{t + 1) — 




•. 0 








b • • • 


•. 1 

.. 0 _ 




_ hfi _ 



u(t) 



(13.50) 



+ 



h 



{y{t) - [10. . .0]fo(^)) 
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It is possible to go from the states x to the states Xq by a transformation matrix 
P: Xo{t) = Px(t)^ so that we obtain the estimation of the states 





—ai — li 1 0 


.. O' 






x{t + 1) = P ^ 


: 0 


•. 0 


Px{t)+p-^ 


■ ■ 






1 








b • • • 


.. 0 _ 




_ _ 



h 






yit) 



(13.51) 

In Eq. (13.46), the states x(t) are at last replaced by their estimation x(t). 
The observer and the state feedback are displayed in Fig. 13.6. 



13.1.4 General Pole-Placement Design 

The choice of polynomial P{q) determines the type of pole-placement obtained. 
The polynomial P{z) can thus be chosen by spectral factorization (Astrom and 
Wittenmark, 1989) from a quadratic criterion 

pOO 

J = / + pu{t)‘^dt (13.52) 

Jo 

from which we draw 

pA{z)A%z)^z^B{z)B^{z) =pP{z)P%z) (13.53) 

where A*(z) is the polynomial obtained from A{z) by replacing z by 1/z. In 
this case, the parameter p alone is sufficient to specify P{z). 

It is also possible to follow the same procedure as for continuous pole- 
placement (Astrom and Wittenmark, 1989); the presentation is realized in a 
general framework before examining particular cases. 

On one hand, if we want to take into account the reference model and if we 
integrate the model Bm{q)/Am{q) in T{q), it is necessary that the polynomial 
Am{q) is a divider of P{q). 

According to Eq. (13.7), the process zeros which are those of B{q) are also 
those of the closed-loop transfer function, except if common roots exist between 
B{q) and P{q). Unstable or weakly damped zeros cannot be eliminated. The 
polynomial B{q) is then factorized as 

B{q)=B+{q)B-{q) (13.54) 

where B^{q) is a polynomial containing all stable and well-damped zeros, which 
can therefore be eliminated, and B~ (q) is a polynomial containing all other 
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zeros. B^{q) is chosen to be monic to make the factorization unique (Astrom, 
1980). 

Lastly, the polynomial P{q) is factorized as 

P{q) = A^{q) B+{q) Ao{q) (13.55) 

where Ao{q) is the polynomial giving the observer dynamics. 

The Bezout equation is then written in the general form 

M<l) S{q) + B{q) R{q) = A^iq) B+ (q) Ao{q) (13.56) 

We deduce that B~^{q) must be a divider of S{q), so that we can set 

S{q)=S'{q)B+{q) (13.57) 



from which results the simplihed Bezout equation which gives R{q) and S'{q) 
A{q) S'{q) + B~ (q) R{q) = Am{q) Ao{q) (13.58) 

and the expression of the closed-loop output 



y{t) 



B-{q)T{q) 
Amisi) ^o{q) 



yr{t) 



(13.59) 



In order to have the same output as that specihed by the reference model, it is 
necessary that 



B-{q)T{q) 
Am {q) Aq{j^ 



Bm{q) 

Am {q) 



B-{q)T{q) 



Ao{q) Bm {q) 



(13.60) 



Rigorously, this equation implies model tracking, as the poles and the zeros 
are considered simultaneously, while the strict pole-placement only takes into 
account the poles. Astrom and Wittenmark (1989) insist on the importance in 
considering the zeros in order to realize a good control system. Moreover, they 
advise not to change the open-loop zeros of the process, but to keep them in 
the model of the tracking dynamics Bm{q)/Am{q)- 

From Eq. (13.60), we deduce that B~{q) must divide Bm{q), thus by setting 



B^{q)=B-{q)B'^{q) 



(13.61) 



we obtain the polynomial T{q) 

T{q) = AM BM = (13.62) 

Landau (1990), using the backward shift operator q~^ ^ proposes a less severe 
condition for Eq. (13.60) by only demanding that the gain between output 
and set point is unitary, and that the output dynamics is that of polynomial 
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Ajn{q ^). In these conditions, it suffices that the polynomial T{q 
to 



T(g-i) = 



Ao{q-^) 
B'{1) 
Ao{q ) 



if: 

if: 



B'{1) = 0 



This approach is, of course, usable with the forward shift operator q. 
The degree conditions are 



is equal 
(13.63) 



deg P > 2deg A — 1 deg Ao > 2deg A — deg A^ — deg B~^ — 1 
deg Am - deg Bm > deg A- deg B 



(13.64) 



Then, it is possible to look at the particular pole-placement cases. In all 
cases, the control law will be given by Eq. (13.18). The reference model Bmj Am 
must be specified by the user and must satisfy the degree condition. 



Pole- Placement without Zero Compensation 



In the absence of zero compensation, it suffices to take B^{tq) = 1 by compar- 
ison with the general case. We obtain the Bezout equation, which gives R{<q) 
and Sigq) 





A{q) S{q) + B{q) R{q) = A^iq) Ao{q) 


(13.65) 


and 


T(n\ Ao{q) Bm{q) 

~ B{q) 


(13.66) 


Pole-Placement with all Zeros Compensated 




When all zeros (necessarily stable) are compensated, it suffices to take B~^{q) = 
B{q)/bo by comparison with the general case, so that B~^{q) is monic. We 
obtain the Bezout equation, which gives R{q) and S'{q) 




A{q) S'{q) + bo R{q) = Am{q) Ao{q) 


(13.67) 


with 


S{q)=S'{q)B+{q) 


(13.68) 


and 


T{q) = Ao{q) Bm{q)/bo 


(13.69) 



Pole-Placement with Stable and Well-Damped Zeros Compensated 

This case is the most general, and thus resumes the equations previously deve- 
loped in the general framework. 

B{tq) is factorized with B^{tq) a monic polynomial containing the stable and 
well-damped zeros. 

B{q)=B^{q)B-{q) (13.70) 

The polynomial P{<q) is equal to 

P{q) = Am{q) B+{q) Ao{q) 



(13.71) 
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where Ao{q) specifies the observer dynamics. 

The simplified Bezout equation 

Mq) S'{q) + B~ {q) R{q) = A^iq) Ao{q) (13.72) 

gives R{q) and S'{q) such that: S{q) = S'{q) B~^{q). 

The polynomial T{q) is equal to 

T{q) = (13.73) 

Example 13.2: Pole-Placement Control of a Chemical Reactor 




The pole-placement control will be ap- 
plied to the chemical reactor described 
in Sect. 19.2. 



The model of the pilot chemical reactor (Sect. 19.2) obtained by identification 
by means of recursive extended least squares is a stable system with transfer 
function 



^ _ B{q) _ 1.203^ + 0.426 

“ A{q) ~ q^ - 1.083 q + 0.104 

The total time delay d + 1 is then equal to 1. This transfer function presents 
two stable poles at z = 0.976 and z = 0.106 and a stable zero at z = —0.354. 
In the present case, the sampling period is Tg = 5 s. 

The observer is chosen so as to ensure faster dynamics than the process 
one. A first-order continuous filter with time constant r = 4s is taken and 
discretized, hence 



Ao{q) = q — 0.286 

a) First, a pole-placement without zero compensation is realized. 

An integrator Hi = q — 1 is added to the polynomial S{q) in order to reject 
step-like disturbances, and a robustness filter H 2 = {q — 0.8)/(l —0.8) is added 
to the polynomial R{q). The degree conditions lead to choose a polynomial 
P{q) of degree 5. The polynomials R{q) and S{q), which will be calculated by 
the Bezout equation, will both have degree 2. 

As a reference model, a transfer function optimal with respect to the ITAE 
criterion (Table 4.1) is chosen 

^ 

^ 5^ + 2AiOos^ + 3.4cJq5^ + 2.7cJq + cJq 




478 



Chapter 13. Digital Control 





Figure 13.7: Pole-placement control without zero compensation of the identi- 
fied linear model of the chemical reactor: output temperature (top), variations 
of the input (bottom). Set point step of amplitude lOK at t = 0 and step 
disturbance of amplitude 5 K at t = dOTg at the output 



with cjo = O.3/T5. It is then discretized with a zero-order holder, thus 

Bm{q) _ 10-3(0.296^3 + 2.848^2^2.511^ + 0.203) 

Am{q) ~ q^ - 3.281^3 ^ 4.153^2 _ 2.399^ + 0.533 

which has the poles 0.818 ± 0.326i and 0.822 ± 0.103b 
The polynomial P{q) is equal to 

P{q) = Am{q) Ao{q) = {q^ - 3.281^3 ^ 4.153^2 _ 2.399^ + 0.533) {q - 0.286) 
The Bezout equation is written as 

A{q) H,{q) S{q) + B{q) R{q) = A^iq) Ao{q) 

{q^ - 1.083 q + 0.104) {q - 1) S{q) + (1.203 q + 0.426) {q - 0.8)/(l - 0.8) R{q) = 
\q^ - 3.281^3 ^ 4.153^2 _ 2.399^ + 0.533) {q - 0.286) 

which gives the controller polynomials R{q) and S{q) (not taking into account 
the integrator and the robustness filter) 

R{q) = 0.0533^2 _ o.0897^ + 0.039 ; S{q) = q^ - 1.805^ + 0.829 

So that B{q) does not divide Bm{q), the previous polynomial Bm{q) is not 
kept, but is replaced by 
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^ B{z)A^{l) 
B{1) 



We deduce the polynomial of the precompensator 



T(z) 



Aq{z) Amjl) 
B{1) 



(2-0.286)0.00586 

1.629 



0.00362 - 0.00103 



The system is subjected to a step set point of amplitude 10 K occurring at 
instant t = 0 and converges towards the set point (Fig. 13.7) without noticeable 
overshoot. Then, after 40 sampling periods, it is subjected to a step disturbance 
of amplitude 5 K acting at the output, which is perfectly rejected due to the 
integrator effect introduced in the controller. The input variations are not too 
violent and are quite acceptable (Fig. 13.7). 

b) Then, a second pole-placement with zero compensation is performed. 





Figure 13.8: Pole-placement control with zero compensation of the identified 
linear model of the chemical reactor: output temperature (top), variations 
of the input (bottom). Set point step of amplitude lOK at t = 0 and step 
disturbance of amplitude 5 K at t = AOTg at the output 

We wish to include an integrator polynomial Hi{q) = q — 1 in the controller 
polynomial S{q) in order to reject step-like disturbances. 

The reference model is calculated from an optimal transfer function with 
regard to the ITAE criterion (Table 4.1) 

( a) — ^ 

^ 5^ + 1, + 2, 15cJq5 + cJq 
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Figure 13.9: Pole-placement control with zero compensation of the identified 
linear model of the chemical reactor: output temperature. Set point step of 
amplitude 10 K at t = 0. Random disturbance of standard deviation 0.5 K at 
the output 



still keeping cjq = O. 3/T5 in order to really compare the results with those of 
the previous case. Then, we discretize with a zero-order holder, hence 

Bm{q) _ 10-3(3.929^2^ 13.720^+ 3.022) 

Am{q) ~ q^ - 2.432^2 ^ 2.044^ - 0.592 

which has the poles 0.811 ± 0.269i and 0.809. 

The polynomial B{q) is factorized into 

B{q)=B+{q)B-{q) 

B+(^) = ^ + 0.354 ; S“(^) = 1.203 

where B~^{q) is monic and contains the stable zero. By keeping the same 
polynomial observer as in part a), the polynomial P{q) is equal to 

P{q) = Ara {q) B+ {q) Ao (q) 

= (q^ - 2.432^2 2.044^ - 0.592) (q + 0.354) (q - 0.286) 

The simplihed Bezout equation is written as 

A(q) Hi(q) S'(q) + B~ (q) H 2 {q) R{q) = Ara{q) AM 

{q^ - 1.083^ + 0.104) {q - 1) S'{q) + 1.203 {q - 0.8)/(l - 0.8) R{q) = 

+ - 2.432^2 2.044^ - 0.592) {q - 0.286) 

giving the controller polynomials (not including the integrator and the robust- 
ness hlter) 



S'{q) =^-0.813^ 

S{q) =B+{q)S'{q) 

= 0.354) (^-0.813) 

= q^ - 0.459^ - 0.288 
R{q) = 0.0295^2 q.000403^ - 0.0176 



The polynomial of the precompensator is equal to 
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Ao { q ) Bm { q ) _ {q ~ 0.286) 10-^(3.929^2 ^ 13.720^ + 3.022) 

“ B-{q) “ L203 

= 10-3(3.266^3 + 10.469^2 _ q.756^ - 0.720) 

The system was subjected to the same step set point of amplitude 10 K 
as in case a). It converges towards the set point a little faster and without 
overshoot (Fig. 13.8). Then, after 40 sampling periods, it is subjected to the 
same disturbance as in a) acting at the output; it is a little better rejected 
than in case a). The input is slightly smoother (Fig. 13.8). When a random 
noise of standard deviation 0.5 K affects the output (Fig. 13.9), the system 
nevertheless maintains the desired behaviour around the set point identical to 
Figure 13.8. 



13.1.5 Digital PID Controller 

Discretized PI Analog Controller viewed as an RST 

The transfer function of an analog PI controller is 



G{s) = K 



1 + 



TjS 



(13.74) 



Using the backward difference approximation as a discretization operator {Tg 
being the sampling period) 



Ts 

q-l 



(13.75) 



we obtain the following equation of the discretized PI controller 



{q - l)u{t) = K 



Ts' 

q-l + — 



{yr{t) -y{t)) 



(13.76) 



This equation can be compared to the general equation of the pole-placement 
(RST) controller (Fig. 13.2) 



S{q)u{t) = -R{q) y{t) + T{q) yr{t) (13.77) 



By identification, we deduce the polynomials R, S, T which characterize a PI 
controller 



S{q) 

R{q) 

T{q) 



= q-l 
= -K \ 



q-l + 



= -m 



h 

Tl 



(13.78) 



We notice that S{q) corresponds to a simple integrator. The coefficients of the 
controller polynomials will be calculated according to the polynomial P, which 
define the closed-loop behaviour by specifying the values of the poles 



A{q) S{q) + B{q) R{q) = P{q) 



(13.79) 
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The polynomial P can be chosen by proceeding to the discretization of one of 
the continuous polynomials optimal with regard to the ITAE criterion (Table 
4.1). 

In the case where the identified process model is limited to polynomials A 
and B of order 1, the calculation of the coefficients becomes very simple. 



Discretized PID Analog Controller seen as an RST 

The transfer function of a real analog PID controller is 



G{s) = K 



1 

1 + + 

T/ S 



Tp S 

1 I m o 
^ ^ N 



(13.80) 



with Tp/N representing a filtering factor of the derivative (derivative time 
constant Tp) and r/ integral time constant. 

We proceed with the same operator as previously; s (derivative action) 
is approximated by {q — l)/Tg and 1/s (integral action) is approximated by 
Ts/{q — 1). We deduce the discrete transfer function of the discretized analog 



PID 



H{q) = K 



Ts 1 N{q-1) 

Ti q-1 ^ TsN - Tp 

td 



(13.81) 



By identifying 

and imposing an integrator in the monic polynomial S{q) = {q — l)S'{q), which 
allows the convergence towards the steady-state, we have 



^{q) = (^ “ 1) (^ + -^i) with: Si = — — (13.83) 

Tp 

It results that 

R{q) =riiq^ Priq^T2 

ro = K{1 + N) 

ri = K{-1 + si + — - 2N] (13.84) 

n 

T2 =K(si(— -l) + iV) 

Tl 

The coefficients of R{q) and S{q) will be calculated in the same way as for the 
PI controller by specifying the polynomial P{q) such that 



A{q) S{q) + B{q) R{q) = P{q) (13.85) 



which defines the closed-loop poles whose transfer function is equal to 



H (a\ = ypPi= B{q)T{q) ^ B{q)T{q) 

y{t) A{q)S{q)+B{q)R{q) P{q) 



(13.86) 
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In order to ensure the asymptotic convergence of the output towards the set 
point, let Hci{l) = 1, the tracking polynomial T{q) is chosen as 



T(z) 



C(l) 

B{1) 



R{1) 



(13.87) 



noting that 5"(1) =0 (because of the integrator term). The closed-loop transfer 
function thus becomes 






C(l) B{z) 
B{1) P{z) 



(13.88) 



13.2 Discrete Internal Model Control 



Discrete internal model control (Garcia and Morari, 1982), (Morari and Zahriou, 
1989) is very similar with regard to its block diagram (Fig. 13.10) and the prin- 
ciple of continuous internal model control explained in Sect. 13.2. It presents 
very good robustness qualities, that is, faced with variations in structure or 
parameters of the process model. It is also possible to use it by taking into 
account constraints on the inputs and the states (Rotea and Marchetti, 1987). 
However, internal model control is, in principle, reserved for open-loop stable 
processes. In the case of open-loop unstable processes, it is necessary to hrst 
proceed to a classical stabilizing feedback. 

The signals represented in the block diagram in Figure 13.10 are sampled 
signals. The process assumed to be open-loop stable, is represented by a dis- 
crete transfer function G{z) and subjected to a disturbance d{t). In practice, 
it would be the actual process. The process model is the discrete transfer func- 
tion G{z) while the internal model controller has the transfer function Gc{z). 
A robustness hlter with transfer function F{z) is, in general, included in the 
feedback loop. 

When the model is perfect, the feedback signal d{t) in the loop is only 
the disturbance d{t). Thus, internal model control appears as an open-loop 
system, implying the necessity of the stability of the process model G and the 
controller Gc- When the model and the process differ, the feedback signal d(t) 
contains information concerning the model error, and it is possible to obtain 
some robustness by acting on d(t) through the robustness filter F{z). 

According to the block diagram, including the filter F{q), the following 
relations are obtained 



u{t) = 

yit) = 



1 + Gc{q) {G{q) — G{q)) 
G{q) Gc{q) 



{G,{q)yr{t)-F{q) G,{q)d{t)) 



■yr{t) + 



1-F{q) G{q) G,{q) 



■ d{t) 



1 + F{q) Gc{q) (G{q) — G{q)) 1 + F{q) Gc{q) (G{q) — G{q)) 

(13.89) 

Suppose that F{z) = 1. From this equation, it results that a perfect con- 
trol minimizing the sum of the error squares, either in regulation or set point 
tracking, would correspond to the following controller 



Gc{z) 



1 

G{z) 



(13.90) 
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Figure 13.10: Discrete internal model control 



Moreover, this controller would give zero deviation with respect to the set point. 
The same controller would also ensure perfect disturbance rejection. However, 
in general, such a controller is neither stable nor physically realizable. 

The process is modelled by a transfer function expressed with respect to 
the poles and zeros 



G{z) 



U7=ii^-Pi) 



(13.91) 



where Zi represents a zero and pi a pole of G. 
The model G{z) is factorized as 



G{z) — G^{z) G-{z) 



(13.92) 



where G-^{z) contains all the time delays (whose inversion would correspond to 
a prediction) and the zeros outside the unit circle (whose inversion would lead 
to an unstable controller), and thus includes all characteristics of nonminimum- 
phase type. 

This factorization is not unique, but Garcia and Morari (1982) recommend 
choosing 

where d is the effective delay of the process and z* represent the n — 1 zeros of 
G{z), while z* are their images inside the unit circle 



f 2:* if: |zj| < 1 

\ 1/zj if: |zj| > 1 



(13.94) 



Equation (13.93) thus retains in G-|_(z) only the unstable zeros and the time 
delays. The transfer function of the controller is then chosen as 



(13.95) 
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In fact, the following rules (Zafiriou and Morari, 1986) will be used for the 
controller synthesis 

— As G{z) is stable, the n poles Pi of G{z) become zeros of Gc{z). 

— The ns stable and well-damped zeros zf of G(z) become poles of Gc{z)^ 
while the nj unstable zeros zj with a positive real part of G{z) are replaced 
by their inverse 1/^f, which will thus be stable poles of Gc{z). 

— The unstable zeros z^ with a negative real part or badly damped zeros 
of G{z) are not taken into account in Gc{z). 

— Poles at the origin are introduced in order to avoid anticipation. 

— The gain of controller Gc{z) is such that 

Gc(l) = ^ (13.96) 

By using these different rules, we obtain the following transfer function for 
the controller 



Gc{z) — 



(-ir 



-n+n^+n'^ 



U7=li^-Pi) 



&onr=i(i-^?)nr=i4 



(13.97) 

where bo is the coefficient of the monomial of the highest degree of the numer- 
ator of G{z) assuming that the denominator is monic. 

A robustness filter F{z) is included in the feedback loop to take into account 
the imperfections of the model compared to the process through the estimated 
disturbance d{t). In these conditions, Eq. (13.89) becomes 



y{t) = 



G{q) 



G.{q) + F{q) {G{q)-G{q)) 



\ ^ \ , 7-1/ \ ^ \ f \\ ^ ' 



GSq)FF{q) (G{q)-G{q)) 

(13.98) 

If the model is perfect: G{pi) = G{pi)^ Eq. (13.98) becomes 



y{t) = G+(?) Vr{t) + (1 - F{q) G+(^))(i(t) 



(13.99) 



Assuming that the gain of the robustness filter is equal to 1: F{1) = 1, the 
asymptotic convergence of the output y towards the set point yr is guaranteed 
if G+(l) = 1. 

When the model is different from the process, the constant disturbances are 
asymptotically rejected if F{1)G~^{1) = 1. Assuming that F{1) = 1, this is 
equivalent to Gc{l)~^ = G{1). Moreover, the output converges asymptotically 
towards the set point if F{1) = 1 and G~^{1) = 1. 

The characteristic equation deduced from Eq. (13.98) allows us to discuss 
the stability of the system 



G-{z) + F{z) (G(z) - G(z)) = 0 (13.100) 

The filter F{z) will be chosen such that this equation has its roots conveniently 
located in the unit circle. Thus, it is possible to use an exponential robustness 
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yr{t) 






Vrefit) 



Figure 13.11: Reference model 



filter which acts on the regulation dynamics 

F(z) = , 0 < a < 1 (13.101) 

z — a 

such that the increase of the filter time constant, thus the increase of parameter 
ce, increases robustness. 

In order to further improve the robustness, it is also recommended to filter 
the set point by a reference model of unit gain, which acts on the 

tracking dynamics (Fig. 13.11). 

Thus, it is possible to separately set the regulation and tracking objectives. 
The Smith predictor is a similar form of internal model control. Internal 
model control can also be considered from the point of view of linear quadratic 
control (Garcia and Morari, 1982). When the use of an exact, although stable, 
inverse in the controller brings excessive control variations, it may be interesting 
to replace the exact inverse by an approached inverse. 

Example 13.3: Discrete Internal Model Control of a Chemical Reac- 
tor 




The reactor discussed in Sect. 19.2 is 
used as the real process in this discrete 
internal model control, and constitutes 
a nonlinear simulation taking into ac- 
count heat transfer and the chemical re- 
action when the latter occurs. 



This reactor was previously identified by means of a very simple ARMAX model 
by the RELS method, with a sampling period Tg = 5 s, which gave 



G{z) 



1.916 
z -0.975 



The robustness filter corresponding to Eq. (13.101) has its constant a = 0.9. 
The controller transfer function is equal to 

z- 0.975 0.522Z- 0.508 

; — 

and corresponds to Eq. (13.97) by use of Eq. (13.93). 

The reactor temperature set point is fixed at 300 K from 0 to 750 s, then 
320 K from 750 s until the end. The reactor is not initially in steady state 
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so that the reactor temperature must iu a hrst step reach the set poiut of 300 
K. The coutrol variable, correspoudiug to the opeuiug perceutage of a valve 
distributiug the heat-couductiug fluid betweeu a hot heat exchauger aud a 
cold heat exchauger, is bouuded betweeu 0 aud 1. The output is affected by 
white uoise of amplitude 0.5. The chemical reactiou starts at 1500 s, aud thus 
coustitutes a disturbauce for the system. 





Figure 13.12: luterual model coutrol of the chemical reactor with Ai7 = 
—7 X 10^ J/mol. Top: output temperature aud set poiut. Bottom: coutrol 
variable 

The hrst simulatiou has beeu realized with a heat of reactiou equal to Ai7 = 
—7 X 10^ J/mol. Figure 13.12 shows that the temperature very correctly reaches 
the hrst set poiut at 300 K, theu the secoud set poiut at 320 K by preseutiug 
very little overshoot. The disturbauce iuhueuce related to the chemical reactiou 
is uot visible. The coutrol variable varies effectively betweeu its lower aud 
upper coustraiuts. The coutrol saturates wheu the physical demaud of the 
heat-couductiug system is at its highest, hrst iu the cooliug stage, theu iu the 
heatiug stage. Iu order to avoid too large variatious of the coutrol, the coustaut 
a of the robustuess hlter was voluutarily importaut, at the risk of losiug a little 
performauce. Iu these couditious, the coutrol varies relatively slowly. The 
peaks at the start aud the set poiut chauge could be avoided by iutroduciug a 
refereuce trajectory. 

Iu order to display hrstly the difficulty of coutrolliug exothermic reactors aud 
secoudly the robustuess of iuterual model coutrol, the heat of reactiou was 
iucreased by a factor of 14 with respect to the coucerued heat of reactiou. 
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Figure 13.13: Internal model control of the chemical reactor at the limit of 
thermal runaway of the reactor in the case where the heat of reaction is large 
{AH = —10^ J/mol). Top: output temperature. Bottom: control variable 

which is considerable. Figure 13.13 was thus obtained for AH = —10^ J/mol. 
In the first stage, after the beginning of the reaction (at 1500 s), a temperature 
deviation of the reactor with the set point occurs, then the system succeeds in 
controlling the temperature of the reactor contents. If the heat of reaction were 
increased to AH = —11 x 10^ J/mol, an increasing deviation would appear 
because of the valve saturation and the temperature would be controlled no 
more. The physical capacity of cooling of the heat-conducting fluid would 
become insufficient and no control system could remedy that problem. 



13.3 Generalities in Adaptive Control 

(Goodwin and Sin, 1984) distinguish three types of control by increasing order 
of difficulty: 

— Deterministic control, where disturbances are absent and the process model 
is known. 

— Stochastic control, where disturbances are present and the process model 
is known. 
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— Adaptive control, where disturbances are present and the process model is 
completely specihed. 

In adaptive hltering, adaptive prediction or adaptive control, a problem of 
parametric estimation occurs. If the parameters are constant, the gain of the 
estimation algorithms converges towards zero, whereas if the parameters are 
time- varying, the algorithm must be able to follow the variations. 

In general, two levels of adaptive control are distinguished: 

• Indirect adaptive control: hrst, an estimation of the model parameters is 
realized, then the controller is calculated (Fig. 13.14). The control law is 
thus obtained in two separate stages. 




Figure 13.14: Indirect adaptive control 

• Direct adaptive control: the right parameters of the controller are directly 
estimated (Fig. 13.15) by implicit integration of the model parameters. 




Figure 13.15: Direct adaptive control 

The case where the controller parameters are already known in advance and 
preprogrammed (as in gain scheduling) is, in reality, regulation, not adaptive 
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control. 

The problem of parametric identification was treated in Chap. 10. The 
adaptive control law is obtained by using the certainty equivalence principle, 
i.e. by replacing the model of the process by its admissible estimation when 
the control law is to be calculated. 

Many control methods are susceptible to being used in indirect adaptive 
control: internal model control, pole-placement, pole and zero placement or 
model reference control. 

Generalized predictive control (Sect. 15.4), with a reference model (Sect. 
15.1), with a partial state reference model (Sect. 15.5) are direct adaptive 
control methods. 

Adaptive control is very attractive because it is possible to hope to take into 
account slow variations of the process or disturbances affecting it. It imposes 
on us the taking of many precautions: convergence of the parameters of the 
process model, robustness of the on-line identification algorithm and stability 
of the control at each instant. 

Many books are devoted to adaptive control and it is not possible to cite 
all of them: Astrom and Wittenmark (1989); Bastin and Dochain (1990); Bit- 
mead et al. (1990); Dugard and Landau (1990); Goodwin and Sin (1984); Lan- 
dau (1988); Landau and Dugard (1986); Sastry and Bodson (1989); Watanabe 
(1992). 

Some review papers are: Astrom (1983); Astrom and Wittenmark (1973); 
Astrom et al. (1977); Garcia et al. (1989); Landau (1974); Seborg et al. (1986). 



References 

K. J. Astrom. Robustness of a design method based on assignment of poles and 
zeros. IEEE Trans. Automat. Control^ AG-25: 588-591, 1980. 

K.J. Astrom. Theory and applications of adaptive control - A survey. Auto- 
matical 19(5):471-486, 1983. 

K.J. Astrom and B. Wittenmark. On self tuning regulators. Automatical pages 
185-199, 1973. 

K.J. Astrom and B. Wittenmark. Adaptive Control. Addison- Wesley, New 
York, 1989. 

K.J. Astrom, U. Borisson, L. Ljung, and B. Wittenmark. Theory and applica- 
tions of self-tuning regulators. Automatical 13:457-476, 1977. 

G. Bastin and D. Dochain. On-Line Estimation and Adaptive Control of Biore- 
actors. Elsevier, Amsterdam, 1990. 

R. R. Bitmead, M. Gevers, and V. Wertz. Adaptive Optimal Control, The 
Thinking Man^s CPC. Prentice Hall, New York, 1990. 




Process Control 



491 



P. De Larminat. Automatique, Commande des Systemes Lineaires. Hermes, 
Paris, 1993. 

L. Dugard and LD. Landau, editors. Commande Adaptative des Systemes. 
Theorie, Methodologie, Applieations, ENSIEG, BP 46, 38402 St-Martin 
d’Heres (Erance), 1990. Laboratoire d’ Automatique de Grenoble. 

G.E. Garcia and M. Morari. Internal model control. 1. A unifying review and 
some new results. Ind. Eng. Chem. Proeess Des. Dev.., 21:308-323, 1982. 

G.E. Garcia, D.M. Prett, and M. Morari. Model predictive control: Theory 
and practice - a survey. Automatiea^ 25(3):335-348, 1989. 

G.G. Goodwin and K.S. Sin. Adaptive Filtering, Predietion and Control. Pren- 
tice Hall, Englewood Gliffs, 1984. 

R. Isermann. Digital Control Systems, volume II. Stochastic Gontrol, Multiva- 
riable Gontrol, Adaptive Gontrol, Applications. Springer- Verlag, 2nd edition, 
1991. 

LD. Landau. A survey of model reference adaptive techniques - Theory and 
applications. Automatiea, 10:353-379, 1974. 

LD. Landau. Identifieation et Commande des Systemes. Hermes, Paris, 1988. 

LD. Landau. System Identifieation and Control Design. Prentice Hall, Engle- 
wood Gliffs, 1990. 

LD. Landau and L. Dugard. Commande Adaptative. Masson, 1986. 

LD. Landau, R. Lozano, and M. M’Saad, editors. Adaptive Control. Springer- 
Verlag, London, 1997. 

R. H. Middleton and G.G. Goodwin. Digital Control and Estimation. Prentice 
Hall, Englewood Gliffs, 1990. 

M. Morari and E. Zafiriou. Robust Proeess Control. Prentice Hall, 1989. 

K.S. Narendra and A.M. Annaswamy. Stable Adaptive Systems. Prentice Hall, 
Englewood Gliffs, 1989. 

M.A. Rotea and J.L. Marchetti. Internal model control using the linear qua- 
dratic regulator theory. Ind. Eng. Chem. Res., 26:577-581, 1987. 

S. Sastry and M. Bodson. Adaptive Control - Stability, Convergenee, and Ro- 
bustness. Prentice Hall, New Jersey, 1989. 

D.E. Seborg, T.E. Edgar, and S.L. Shah. Adaptive control strategies for process 
control: a survey. AIChE J., 32(6):881-913, 1986. 

M. Vidyasagar. Control Systems Synthesis: A Faetorization Approaeh. MIT 
Press, Gambridge, Massachusetts, 1985. 




492 



References 



K. Watanabe. Adaptive Estimation and Control. Prentice Hall, London, 1992. 

E. Zafiriou and M. Morari. Design of robust digital controllers and sampling- 
time selection for SISO systems. Int. J. Cont., 44(3):711-735, 1986. 




Chapter 14 



Optimal Control 



14.1 Introduction 

Frequently, the engineer in charge of a process is faced with optimization pro- 
blems. In fact, this may cover relatively different ideas. 

For example, consider a chemical reaction studied in a laboratory; the 
chemist seeks the best kinetic parameters: stoichiometries, reaction orders, rate 
constants, activation energies, i.e. the parameters that will help to represent 
the reacting system optimally. Then, the chemist realizes a static optimiza- 
tion, which most often consists of minimizing the distance between a set of 
experimental data and the corresponding prediction given by the model. 

Now, consider the engineer in charge of a process, thus responsible for the 
real plant production. The laboratory chemist will have given to the engineer 
the information which seemed necessary concerning the reaction and has pro- 
posed a recipe for the experimental operation guide. The engineer knows that 
the reactor, being continuous, batch or fed-batch, and more or less similar to 
a perfectly stirred reactor or a plug-ffow reactor, will behave, in reality, rela- 
tively differently to the lab reactor and will be closer to the pilot reactor, if 
the latter was previously operated. For example, he or she knows, that the 
reactive feed ffow rate profile for a fed-batch reactor and the temperature or 
pressure profile to be followed for a batch reactor, will have an infiuence on the 
yield, the selectivity or the product quality. The engineer wishing to optimize 
production must then seek a time profile and realize a dynamic optimization 
with respect to the variables which he or she can manipulate, while respecting 
the constraints of the system such as the bounds on temperature and tempe- 
rature rise rate, the constraints related to the possible runaway of the reactor. 
Similarly, an engineer realizing a reaction in a tubular reactor can seek the 
optimal temperature profile along the reactor. In the latter case, it is a spatial 
optimization very close to the dynamic optimization where the time is replaced 
by the abscissa along the reactor. The profile thus determined is calculated 
in open loop and will be applied as the set point in closed loop, which may 
lead to deviations between the effective result and the desired result. The di- 
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rect closed-loop calculation of the profile in the nonlinear case is not studied 
here; on the contrary, the linear case is treated in linear quadratic control and 
Gaussian linear quadratic control. 

In a continuous process, problems of dynamic optimization can also be con- 
sidered concerning the process changes from the nominal regime. For example, 
the quality of the raw petroleum feeding the refineries changes very often. The 
economic optimization realized off-line imposes set point variations on the dis- 
tillation columns. An objective for the engineer can be to find the optimal 
profile to be followed during the change from one set of set points to another 
set. 

In all cases, the engineer needs a dynamic model that is sufficiently repre- 
sentative of the behaviour of the process, and which is also of a reasonable 
complexity with respect to the difficulty of the mathematician and numerical 
task of solving. 

Among the criteria that the engineer wishes to optimize, of course, can be 
found the reaction yield or the selectivity, and also the end-time taken to reach 
a given yield, or any technical-economic criterion which simultaneously takes 
into account technical objectives, production or investment costs. 

Optimal control is the formulation of the dynamic optimization methods in 
the framework of a control problem. 

14.2 Problem Statement 

The optimal control problem is first set in continuous time. The studied system 
is assumed to be nonlinear. Applications which will follow in control will be 
developed only for linear systems. 

The fixed aim in this problem is the determination of the control u{t) mini- 
mizing a criterion J {u) while verifying initial and final conditions and respect- 
ing constraints. The optimal control thus denoted by makes the state x(t) 
follow a trajectory which must belong to the set of admissible trajectories. 

The formulation of the optimal control problem is the following: 

Consider a system described in state space by the set of differential equations 

x{t) = f{x{t),u{t)) ; to<t<tf (14-1) 

with X being a state vector of dimension n and u a control vector of dimension 
m. The system is subjected to initial and final conditions, called terminal (or 
at the boundaries) 

k{x{to),to) = 0 ; l{x{tf),tf) = 0 (14.2) 

Moreover, the system can be subjected to instantaneous inequality constraints 

p{x{t)^u{t)^t) < 0 Vt (14.3) 

or integral constraints (depending only on to and tf) 
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The question is to find the set of the admissible controls u{t) which minimize 
a technical or economic performance criterion J{u) 

ftf 

J{u) = G{x{to),to,x{tf),tf) ^ / F{x{t),u{t),t)dt (14.5) 

G is called the algebraic part of the criterion. F is a functional^. Frequently, 
the initial instant is taken as to = 0- 

The performance index J depends on the type of problem. Some examples 

are: 

• For a minimum-time problem (minimize tf) 

rtf 

J = / dt = tf (14.6) 

Jo 

• Minimization of the state error variance 

J = [ (^(t) — x^^^{t))‘^dt (14.7) 

Jo 

where x^^^ is the reference state. 

• Combination of performance indices 

J = [ dt F li f {x{t) — x^^^{t))‘^dt (14.8) 

Jo Jo 

where /i is a weighting factor. 

• Weighted sum of the state error variance and the control error variance 

J = [ jj .1 [x{t) — x^^^{t)]‘^ dt tt 2 [u{t) — dt (14.9) 

Jo 

• Use of a nonlinear functional 

rtf 

J= / F{x(t),u(t),t) dt (14.10) 

Jo 

• Combination with a functional 

rtf 

J = G{x{G)^u{G)^x{tf)^u{tf))F / F{x(t),u(t),t) dt (14.11) 

Jo 

Notice that an inequality constraint such as (14.3) can be replaced (Boudarel 
et ah, 1969) by an equality constraint, by adding an auxiliary function according 
to Valentine’s method, as 

p{x{t),u{t),t) F y‘^{t) = 0 Ft (14.12) 

functional is a function of functions: the function F(a?(t), t) depends on functions 
x{t) and u{t). 
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Similarly, the inequality (14.4) becomes 

ftf 

/ [q{x(t),u(t),t) z(t)‘^]dt = 0 (14.13) 

Jto 

In fact, by introducing the variable w{t) such that 

'^{t) = [ [Q{x{r),u{r),T) ^ z{r)‘^]dT (14.14) 

Jto 

the integral constraint is transformed into an instantaneous constraint 

w{t) - [q{x{t), u{t), t) + z{t)‘^] = 0 (14.15) 

where w must verify the terminal conditions: w{to) = 0, w{tf) = 0. 

In this very general form, this problem makes use of the equality constraints 
corresponding to the state differential equations, the terminal equality con- 
straints, possibly instantaneous or integral inequality constraints, and m inde- 
pendent functions, which are the controls u{t). The term G{x{to), to, ^(^/)? ^/) 
represents a contribution of the terminal conditions to the criterion whereas 
the integral term of Eq. (14.5) represents a time-accumulation contribution. 

Several methods allow us to solve this type of problem: variational meth- 
ods (Kirk, 1970), Pontryagin maximum principle (Pontryaguine et ah, 1974), 
Bellman dynamic programming (Bellman, 1957). The books cited here (Borne 
et ah, 1990; Boudarel et ah, 1969; Bryson, 1999; Bryson and Ho, 1975; Feld- 
baum, 1973; Pun, 1972; Ray and Szekely, 1973) propose compared approaches. 

Variational methods are first presented in a mathematical form which can 
be qualified as dynamic optimization, in order to distinguish the tools indepen- 
dently from the control problem. Then, the control problem is treated by a 
specification of some variables. 

14.3 Variational Method in the Mathematical 
Framework 

In the most general mathematical form, in the classical variational method, the 
performance index to be minimized with respect to the vector of functions / 
(of dimension n) of the variable x is the following 

J{f) = G{xo, f{xo),xi, f{xi)) + f F{x, f{x), f{x))dx (14.16) 

Jxo 

where / and / are considered as independent variables. The vector of functions 
/ must satisfy the following equations 

(pi{x, f{x), f{x)) = 0 i = l,...,n^<n 



kj{xo, f{xo)) =0 j = l,...,no 



(14.17) 

(14.18) 
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f{xi)) =0 j = no + 1, . . . , no + ni < 2n + 2 (14.19) 

In this form, the problem is called a Bolza problem. If the functional F of Eq. 
(14.16) is zero, this is a Mayer problem. If G is zero, it is a Lagrange problem. 
General comments: 

• Notice that in the performance index J, the term G depends only on the 
initial (or lower) limit and on the hnal (or upper) limit xi, while the 
integral term depends on the whole history between the initial and hnal 
limits. 

• Equation (14.17) is a system of differential equations. 

• Equation (14.18) represents the constraints at the initial limit. 

• Equation (14.19) represents the constraints at the final limit. 

• If we define an additional variable 

fn+i{x) = G{xo, f{xo),x, f{x)) + f F{^, f{^), f{^))d^ (14.20) 

J Xo 

equation equivalent to 

fn+i{x) = G'^f + Gx + F{x, f{x), f{x)) with: /„+i(xo) = 0 (14.21) 

and the performance index J equal to 

«/ = /n+i(a;i) = G(xo, /(xo),xi, /(xi)) + f F{x, f{x), f{x))dx (14.22) 

Jxo 

the problem is analogous to the Pontryagin method. 



14.3.1 Variation of the Criterion 

Assuming that /* is the optimal solution belonging to the domain of admissible 
trajectories, we study the infiuence of the variations^ of / in the neighbourhood 
of /* on the criterion J. 

^Several mathematical relations are useful: 

a) We denote by i/z the partial derivative dyjdz^ where z is a scalar. If y is scalar and z a 
vector, the notation yz is the gradient vector of the partial derivatives dyjdzi. If y and z 
are vectors, the notation y^, represents the Jacobian matrix of the current element dyijdzj. 

b) The derivative with respect to f of the integral with fixed boundaries 

^ = !xo 



is equal to 








±F. 

dx f 



dx 



c) According to the Euler-Lagrange lemma (Cartan, 1967), if C{x) is a continuous function 
(vector) on [a, b] verifying 



{x)v{x)dx = 0 

for all function (vector) v{x) which is continuous and becomes zero at the boundaries, then 
C(x) is zero everywhere on [a, 5]. 
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In the most general case, supposing that the boundaries xq and x\ are not 
fixed, noting that /o = /(xq) and likewise for /i, and Fq = F(xo, f{xo)j f{xo)) 
and similarly for Fi, the variation of criterion J of Eq. (14.16) related to a 
variation of these boundaries is equal to 



5J = 



+ 



OfV fdF\ 

w 



Sf 



dx + [Fi 5x\ — Fq (5xo] 



dG\ , 

W I OXo + 

OXo J 



dGV 

WJ 



Sfo 



+ 



f dG 

I 



dCX 

WJ 



1 ^/l 



(14.23) 



The second part of the integral term can be expressed as 




(14.24) 



On the other hand, the following relations expressing the variation of the ter- 
minal functions as the sum of two contributions are used 



= ^f(xo) + f{xo)H^o) and Sf^ = Sf{xi) + f{xi)S{xi) (14.25) 



Using an integration by parts of the second term of the integral, the criterion 
variation equation can be transformed into 

iT 



SJ = 



+ 



xo 

dG 

dxi 



^ A 

[df dx\dfJ\ 

T ^ 

+ I i"- ( ^ I / 






Sfdx 
5x\ F 
5xo — 



dG 

dfi U/yj 



iT 



OJ 



dG /dF\ 
dfo ^ \df ) Q 



Sfi 

iT 



Sfo 



dG / fdPy . 

(14.26) 

Moreover, the variations at the boundaries are dependent because of the con- 
straints (14.18) and (14.19), giving the relations 



dk\ f dk\ 

dl\ ^ , f dl' 






(14.27) 



A necessary, but not sufficient, condition of the minimum of the criterion is 

5J = Q , W 5f, 5fQ, 5f^, 5xo, 5xi (14.28) 
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14.3.2 Variational Problem without Constraints, Fixed 
Boundaries 

Consider the simple case where the criterion to be optimized is simply in the 
form 

l>Xi 

J= / F{x^ f{x)^ f{x))dx (14.29) 

Jxo 

and the boundaries xq and xi are hxed. We seek the admissible optimal tra- 
jectory /*, minimizing the criterion J with respect to / 



/ 



min J 

/ 



(14.30) 



From Eq. (14.26), using the Euler-Lagrange lemma, we deduce the necessary 
Euler conditions so that J has a local extremum in /* (necessary condition of 
stationarity), i.e. the following vector is zero 




=0 

dx df J ^ 



The system (14.31) can be written again in the following form 



(14.31) 



= 0 ( 14 . 32 ) 

df dx df df 

and thus constitutes n second-order differential equations. The 2n degrees of 
freedom are hlled by the n initial and n hnal conditions. 



14.3.3 Variational Problem with Constraints, General Case 

Consider the simple case where the criterion to be optimized is 

J = G{xo, f{xo),xi, f{xi)) + f F{x, f{x), f{x))dx (14.33) 

Jxo 

and the boundaries xq and xi are not hxed. We seek the admissible optimal 
trajectory /*, minimizing the criterion J with respect to / 



/* = arg j^rmn J J (14.34) 

When the problem includes equality constraints such as Eq. (14.17) 

<p{x, f{x), f{x)) =0 Vx G [xo, Xi] (14.35) 

or inequality constraints transformed into equality constraints such as Eq. 
(14.12) 

p(x, /(x),/(x)) -f y^(x) =0 Vxg[xo,xi] (14.36) 
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and such as Eq. (14.13) 



[q{x, f{x), f{x)) + z{x)‘^]dx = 0 



(14.37) 



we apply the Euler conditions (as well as all the terminal conditions, disconti- 
nuity conditions and conditions relative to the second variations) to the aug- 
mented function 



^“(^, /, /, y, 'w) = F{x, /, /) 

+A^(x) cj) + fi^{x) [p + y‘^] + u^{x) [w - q - 

(14.38) 

obtained by introducing for each constraint a Lagrange or Kuhn-Tucker para- 
meter. In this case, the variables concerned by the Euler conditions are: /, 
y, z, w. The Euler conditions applied to the augmented function give the 
following equations 

— With respect to variable / 



/dF^\ 

F ^ 



=0 ^ 

\dx df C 



(14.39) 

qy) = 0 



— With respect to variable y 



— With respect to variable 2 : 



2/r^y = 0 



2u'^z = 0 



(14.40) 

(14.41) 



— With respect to variable w 

i> = 0 u = constant (14.42) 



After applying the Euler conditions, the boundary conditions and the tra- 
jectory discontinuities must be taken into account. 

Terminal Conditions 

The constraints (14.18) acting on the initial boundary xq and (14.19) acting 
on the final boundary xi are called the terminal conditions. They express that 
the terminal values of functions /, x, belong to hypersurfaces or, more simply, 
that relations link the terminal values of functions f{x). 

Transversality conditions 

Taking into account the variation of criterion J described by Eq. (14.26), 
the Euler condition of stationarity and the constraints acting on the boundary 
variations (5 /q, Sxq^ 5x\^ we obtain the transversality conditions, which 
are written as: 
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At the initial boundary xq 



<^/o=0 



At the hnal boundary X] 



<5/0 = 0 



(14.43) 



<5/1=0 



<5/1=0 



(14.44) 

This means that the extremizing trajectory must have in the phase space 
of variables fi the same slope as the trajectories k and I at the initial and hnal 
boundaries xq and xi respectively. 

If the initial boundary xq is hxed, this leads to Sxq = 0 (similarly, Sxi = 0 
if the hnal boundary xi is hxed). If the initial extremity of the trajectory /q 
is hxed, this leads to (5 /q = 0 (similarly, Sf^ = 0 if the hnal extremity /i is 
hxed) . 

Discontinuity Condition 

This condition is also called the Weierstrass-Erdmann condition. The disconti- 
nuous extremizing trajectories are composed of sub-arcs joined by discontinu- 
ities. At these points, the partial derivatives of two sub-arcs must be equal 



(14.45) 



h dfi 



C dfi 



; / = 1, . . . , n (14.46) 



In optimal control, an extremizing trajectory thus can be composed of a hrst 
arc, where the control is saturated at its minimum value //_, followed by an 
optimal arc and then followed by an arc where the control is saturated at its 
maximum value //+. In practice, it is common that the control takes only the 
minimum and maximum values (bang-bang control). 

Partial conclusion and first solution 

After having applied the Euler equations and the terminal and discontinuity 
conditions, we have, in fact, realized a set of necessary conditions for obtaining 
a stationary solution, but not sufficient for the minimum solution that will be 
called a hrst solution. 

Conditions relative to the second variations 

Weierstrass-Erdmann and Legendre-Clebsch conditions are necessary condi- 
tions for the criterion J to be minimum. Weierstrass conditions are relative to 
large variations and Legendre conditions to small variations. 
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Weierstrass- Erdmann Condition 

The necessary condition for the performance index J to be minimum is that 
the Weierstrass function W verifies 

W = F-{r,f,x)-F-{f*,f ,x)-{f-f f(^) >0 (14.47) 

V 9/ /, 



at any point of the extremum arc, for all large variations Afi that are compat- 
ible with constraints (pi. 

Legendre- Clebsch Condition 

This condition is relative to small variations Sf in the neighbourhood of /* 



f)2 TFa 



(14.48) 



If a maximum performance index were sought, the two previous inequalities 
would have an opposite sign. 

By applying the conditions relative to the second variations, the optimal 
solution is completely determined. 



14.3.4 Hamilton-Jacobi Equation 

The problem formulation based on the use of the Hamiltonian allows us to 
express Euler equations in the canonical form of Hamilton equations. The 
Hamilton-Jacobi equation bearing on the criterion optimization leads to us ob- 
taining the optimal trajectory. The discrete Hamilton-Jacobi equation can be 
very well compared to the Bellman optimality principle in dynamic program- 
ming (Sect. 14.5). 

If, on the optimal trajectory, we introduce the variable tp{x) defined as 



dF 

tp(x) = r 

df 



the Euler condition (14.31) becomes 



Provided that the matrix 



d dF 

= df 



d^F 



df 



(14.49) 



(14.50) 



(14.51) 



is nonsingular, the function / can be expressed as a solution of implicit Eq. 
(14.49) as 

hn = P(/(a;), V’(a;), x) (14.52) 

We define the Hamiltonian function or Hamiltonian (similar to the Lagrangian 
function for classical optimization problems) as 



H{f{x),'ip{x),x) = -F{f{x),f{x),x)+'ip'^p{f{x),'ip{x),x) (14.53) 




Process Control 



503 



The partial derivatives of the Hamiltonian are equal to 

Hf = -Ff - f^fFj = -Ff -p^Fp+p’^ip = -Ff 

H-ijj = -f^F^+p + p’^'ip = -p’^Fp+p+p’^'ip =p 

=-F,-Fjf,-Fjf,+iPlp + iP^p, 

= -F^ --ip P- 'ip^P:, F-ip p + 'ip'^p^ = -F^ 



On the other side, using the hrst two expressions of the partial derivatives, the 
derivative of the Hamiltonian gives 

H TT 

— = Hjf + H^->P + H, = -F, (14.55) 

from which we get the Hamilton canonical conditions for optimality 



= H^ 

" ~^f (14.56) 

= -F^ 

The hrst two equations of this system constitute a set of 2n equations equivalent 
to Euler conditions. 

In the neighbourhood of the optimal trajectory /*, realize a variation of 
the trajectory at the boundary x but with always verifying the hnal condition, 
and consider the criterion at the optimal trajectory /* 

J{f\ X) = G{fl xi) + r F{^, no, fiO)d^ (14.57) 

J X 

According to Eq. (14.26), the variation of criterion J7 in the neighbourhood of 
the optimal solution f* is given by 



/ 

->P 

dH 

dx 



SJ=-i^F- Fjrj dx - FjSf = H{f*{x), ^{x), x)Sx - ^ix) Sf{x) 

(14.58) 

hence 

j, = H{r{x),i,{x),x) 

Jf = -^(x) 

The system of Eqs. (14.59) provides the Hamilton-Jacobi equation 

J,=H{r{x),-Jf,x) (14.60) 

which is a hrst-order partial derivative equation which admits as a solution 
the integral ^x) dehned by Eq. (14.57). Along an optimal trajectory, the 
criterion solution of Eq. (14.60) is optimal. The boundary condition of J takes 
into account the end part which is eventually present in criterion (14.57) 

57(r,xi) = G(/t,xi) 



(14.61) 
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The transversality conditions are deduced from Eqs. (14.43) and (14.44): 
at the initial boundary xq, where the constraint is satisfied for all 5xo and 

fdk\ , fdk\ _ , , 

the transversality condition is imposed 

dG 1 r dG 1 ^ 

^ '^0 ^/o = 0 (14.63) 

axo \ I afo 

at the final boundary xi, where the constraint is satisfied for all 5x\ and 

the transversality condition is imposed 

■ QQ 1 r QQ 1 ^ 

— Hi Sxi — h '01 Sf 1=0 (14.65) 

[dxi \ [dfi \ 

If xi is fixed, from Eq. (14.65), the frequently encountered condition results 

BG 

^{xi) = (14.66) 

ofi 

14.4 Optimal Control 

14.4.1 Variational Methods 

As, henceforth, we place ourselves in the framework of optimal control, x plays 
the role of time and the variables fk are divided in two types: state variables 
Xi {I < i < n) and control variables uj (1 < j < m), so that the new problem 
is formulated as: 

Given a criterion 

rtf 

J{u) = G{x{to), u{to), x{tf), u{tf)) + / F{x{t), u{t), t)dt (14.67) 

Jto 

we seek the optimal control trajectory 'it*(t) that minimizes J{u) 

n*(t) = arg |rmn J(n)| (14.68) 

the state and control variables being subjected to the constraints 

(j)i=Xi- fi{x, u,t) = 0 i = 1, . . . , n (14.69) 

kj{x{to), u{to), to) =0 j = 1, . . . , no (14.70) 

lj(x(tf), u(tf),tf) =0 j = no + 1, . . . , no + ni < 2n + 2 (14.71) 
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In the criterion (14.67), the hrst term G is called the algebraic part and the 
second term is called the integral part. Note that the Eqs. (14.69) represent 
the dynamic model of the process. 

When the state can only vary in a given domain, it is preferable to introduce 
new variables. For example, in the case of a one-dimensional state x, such that 
a < X < 5, we can set the variable z such that {x — a){h — x) = . It is also 

possible to do the same for the control u when the latter is bounded between 
two minimum and maximum values. 



14.4.2 Variation of the Criterion 

In the most general case, the criterion variation is equal to 



5J= 



£'( 



Ho 

+ 

+ 






dx) '’‘'■'‘Uuj 



dO\ „ / dO'\ , , , 

dG\ f dG\ ^ f dG , ^ 



dt -\~ ^/) ^o) ^^0 

dG 



(14.72) 

The derivative of variations 5x can be expressed from the state equations as 









(14.73) 

The latter equation can be multiplied by the Lagrange multipliers and inte- 
grated between to and t/, so that 



iy(tf (g) jx- (g) j„|*=o 

By summing this equation and Eq. (14.72), one obtains 



(14.74) 



SJ = 



Hf 

Jto 






6x + 



fdF\ 



\dx) dx 

T F {x ^ ^ i F(^X q^ to) 

dG\ 






dG\ fdG\ ^ 



\du J 

Suo 



du 



Su > dt 



dG\ f dG\ ^ [ dG , ^ 

sj) + W,) + W, I 



J“ 

dto 



^-5xdt 



i,{ty 



dt 



(14.75) 
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The last integral of (14.75) can be integrated into parts, thus 

f -^Sx dt = Sxf — Sxo — f Sx dt (14.76) 

Jto Jto 

If, according to the Hamilton- Jacobi formalism (Sect. 14.4.5), we furthermore 
introduce the Hamiltonian H equal to 

H{x, n, '0, t) = —F{x, n, t) + '0(0^ 0 (14.77) 

the variation of the criterion becomes 



SJ = 



f" 

Jto 






Sx — 



\du) 



6u > dt 



+ 

+ 



,,, 

— 1 ^ H{xo,uo,'il^o,to) 
9G\ ,,, 






Sxo + ( - — ) Suo 



Stf + 






dG 
duo 

+ (^) ■*“' 

(14.78) 



This equation, giving the variation of the criterion, is necessary for understand- 
ing the origin of Hamilton- Jacobi equations (Sect. 14.4.5). 



14.4.3 Euler Conditions 

According to the performance index, the augmented function is defined 

n 

F^{x, i, n, t) = F{x, n, t) + Xi 4>i (14.79) 

i=l 

Notice that the function G does not intervene in this augmented function, as 
G depends only on the terminal conditions. G would only intervene in if 
the terminal conditions were varying. 

The variables are the control vector u(t)^ the state vector x(t) and the 
Euler-Lagrange multipliers A. Euler conditions give 

dF^ d dF^ 

duj dt duj 

dF^ d dF^ 



dxi dt dxi 

dF^ d dF^ 

dXi dt dXi 

The third group of this system of equations corresponds to the constraints 
= 0, thus is a system of differential equations with respect to x. The first 
group is a system of algebraic equations. The second group is a system of 
differential equations with respect to A. 



= 0 j = l,...,m 

= 0 i = l,...,n (14.80) 

= 0 i = 1, . . . , n 
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If inequality constraints of the type (14.3) or (14.4) are present, the Valen- 
tine’s method already discussed in Eqs. (14.12) and (14.13) should be used to 
modify consequently. 

On the other hand, the terminal conditions (14.70) and (14.71), which are 
transversality and discontinuity conditions, as well as the conditions relative to 
the second variations will have to be verihed. 

The transversality equations deduced from Eqs. (14.43) and (14.44) are: 

At the initial time to 



oto V / 0 



6to + 



dG 
dxo 

^ ,dk\ , f dk\ , 

with: \ -^ j ^^0 + J oxo = 0 



iT 



+ A(to) 



6xq = 0 



At the hnal time tf 



dtf V // 



with: 



dl\ 

dtjj, ^ \dx 



5tf + 

dl ^ 

/ 



dG 

dxf 

Sxf = 0 



iT 



+ H'^f) 



Sxf = 0 



(14.81) 



(14.82) 



At a hxed hnal time, from Eq. (14.82), the following condition results 



Htf) 



dG 

dxf 



(14.83) 



Example 14.1: Linear Quadratic Control using Euler Conditions 

A one-dimensional hrst-order linear system is dehned by the following differen- 
tial equation 

X = ax -\-bu (14.84) 

We wish to minimize with respect to control u{t) the quadratic criterion 



J=^ [ {u^ x‘^) dt (14.85) 

2 Jo 

This corresponds to a single-input single-output continuous linear quadratic 
control with a finite horizon tf. 

The augmented function is defined 

i (u^ + x^) + A(± — ax — bu) (14.86) 

The Euler conditions for optimality give 



dF^ 



du 

dF^ 



dx 

dF^ 



dX 



dF^ 

dt du 
d dF^ 

dt dx 
d dF^ 

dt dX 



= u — Xb = 0 
= x — Xa — X = 0 
= X — ax — bu = 0 



with: X(tf) = — — — 

dxf 

with: x(0) = xo 



0 (14.87) 
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that is, two differential equations and an algebraic equation. By replacing A 
with respect to u from the first equation into the second one, and by eliminating 
u by differentiating the third equation, we obtain a unique differential equation 
with respect to the state 

x-{a^ + b^)x = 0 (14.88) 

hence the general solution 

x{t) = a exp(— \/ + h‘^ t) + /3 ex p(\/ + 5^ t) (14.89) 

a and (3 are two constants to be determined from the terminal condition on 
the state x(0) and the transversality condition on the adjoint variable X{tf). 
However, this is a two-boundary problem that is difficult to solve. The control 
u{t) is deduced from the model Eq. (14.84). 

Numerical Issue: 

Indeed, a possibility for obtaining the numerical solution is the following. Given 
an initial value of the control vector, the differential Eq. (14.84) describing the 
model is integrated forwards in time, then the adjoint variable differential Eq. 
(14.87) is integrated backwards in time on the basis of the previous states. A 
new control profile is obtained, for example, by a gradient method, and the 
process is repeated iteratively until there is practically no change in the pro- 
files or the criterion is no more improved. 



14.4.4 Weierstrass Condition and Hamiltonian 
Maximization 



The Weierstrass condition (14.47) relative to second variations is applied to the 
augmented function 

X, n, t) = F{x, n, t) + [x — f(x, n, t)] (14.90) 

in the neighbourhood of the optimum, thus 



F^{x"^,x, n, t) 



F^{x* ,xX ,t) — {x — i*)^ 



dF^ 

dx 



> 0 



(14.91) 



By clarifying these terms and using the constraints 



X = f{x^,u,t) 

= f{x*,u*,t) 

the Weierstrass condition is simplified as 

F(a:^*, n, t) — F(a:^*, n*, t) — (/(^c*, t) — /(a:^*, t^*T)) ^ 0 

f{x\ u*, t) - F{x*,u*, t)] - [a^ f{x*,u, t) - F{x\ m, t) 



(14.92) 



> 0 

(14.93) 
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in which we recognize the expression of the Hamiltonian (setting A = '0) 

H{x\ M, A, t) = -F{x\ M, t) + A^ f{x\ u, t) (14.94) 

We then obtain the fundamental result that the optimal control maximizes the 
Hamiltonian while respecting the constraints 

H{x\u\\t) > H{x\u,X,t) (14.95) 



which will be generalized as Pontryagin’s maximum principle. 

Legendre-Clebsch condition for small variations would have allowed us to 
obtain the stationarity condition at the optimal trajectory, in the absence of 
constraints, as 



fdH 
\ du 



= 0 



(14.96) 



and 




(14.97) 



14.4.5 Hamilton- Jacobi Conditions and Equation 

The Hamiltonian^ is deduced from the criterion (14.67) and from constraints 
(14.69); it is equal to 

H{x{t),u{t),'tp{t),t) = -F{x{t),u{t),t) + f{x{t),u{t),t) (14.98) 



The variation of the criterion has been expressed with respect to the Hamil- 
tonian through Eq. (14.78). We deduce the canonical system of Hamilton 
conditions 



X =H^ 

i, =-Hx 



(14.99) 



which are equivalent to Euler conditions, to which the following equation must 
be added 

Ht = -Ft (14.100) 

The second equation of (14.99) is, in fact, a system of equations called the 
costate equations, and ip is called the costate or the vector of adjoint variables. 
The derivative of the Hamiltonian is equal to 



— = H^x ^ H^u ^ ^ Ht = Ht (14.101) 

^ Other authors use the definition of the Hamiltonian with an opposite sign before the 
functional, i.e. 

H u{t), t) = 

which changes nothing, as long as we remain at the level of first-order conditions. However, 
the sign changes in condition (14.83). See also the footnote in Sect. 14.4.6. 
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If u{t) is an optimal control, one deduces 



H = Ht 



(14.102) 



Generally, the concerned physical system is time-invariant so that time does 
not intervene explicitly in / and also in the functional F, and that Eq. (14.102) 
becomes 

F = 0 (14.103) 

In this case, the Hamiltonian is constant along the optimal trajectory. 

The transversality conditions are deduced from Eqs. (14.63) and (14.65) or 
from Eq. (14.78): 

At initial time to 






with: 



dk 

dt 



StQ + 



StQ + 



dG , , ; 

T '^(^o) 

dxo 



+ t; — ouq = 0 
OUq 



dk 

dx 



Sxo = 0 



(14.104) 



At final time tf 



dl 



Stf + 



with: — 6tf^ 

dt 



dG 

dxf 

dx ) 



iT 



+ 'ip(tf) 



Sxf = 0 



r r 

oxf -\- — — ouf = 0 

dUf 



(14.105) 



It is possible to calculate the variation SJ' associated with the variation St 
and with the trajectory change of Sx, the extremity Xf being fixed, for the 
criterion defined in a similar way to Eq. (14.57) by 



J{x^,t) 



G{x'"{tf),tf) + 




F(a:^*, n*, r)dr 



(14.106) 



The variation of the criterion can be expressed with respect to the Hamiltonian 



SJ{x* ,t) = J{x* + Sx{t), t^ St) - J{x* ,t) 

= -(f- FTx*) St - Fpx{t) 

= -F(x*,u*,t)St (14.107) 

= [H{x* ,u* f{x*,u*,t)] St 
= H{x* , u* , tp, t) St — tp^{t) Sx(t) 

The optimal control corresponds to a maximum of the Hamiltonian. Erequently, 
the control vector is bounded in a domain U defined by Umin and Umax- fa this 
case, the condition that the Hamiltonian is maximum can be expressed in two 
different ways: 

• When a constraint Ui is reached, the function H defined by Eq. (14.98) 
must be a maximum. 
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• When the control is located strictly inside the feasible domain U dehned by 
Umin and Umax, the derivative of function H dehned by Eq. (14.98) with 
respect to u is zero 

-=0 (14.108) 

This equation provides an implicit equation that allows us to express the opti- 
mal control with respect only to variables n* = u"" hence the 

new expression of the criterion 



= Hix"" ^ u^ix^ '0, t), '0, t)5t — 5x(t) 

= + sJx 



(14.109) 



thus by identihcation 



Jt = H{x* 
Jx =-^{t) 



(14.110) 



This equation shows that the optimal value of the Hamiltonian is equal to 
the derivative of criterion (14.106) with respect to time. The Hamilton- Jacobi 
equation results 



Jt - H{x\u%x, -Jx, t), -Jx, t) = 0 (14.111) 

with boundary condition 



J{x),tf)=G{x*{tf),tf) (14.112) 

The Hamilton- Jacobi equation is a hrst-order partial derivative equation with 
respect to the sought function J . Its solving is, in general, analytically impos- 
sible for a nonlinear system. In the case of a linear system such as Eq. (14.214), 
its solving is possible and leads to a Riccati differential equation (14.231). Thus, 
it is possible to calculate the optimal control law by state feedback. Recall that 
the Hamilton- Jacobi Eq. (14.111) in discrete form corresponds to the Bellman 
optimality principle in dynamic programming (Sect. 14.5). 

14.4.6 Maximum Principle 

Now, examine brieffy the Maximum Principle^ (Pontryaguine et ah, 1974) 
about process optimal control. Pontryagin emphasizes several points: 

• An important difference with respect to variational methods is that it is not 
necessary to consider two close controls in the admissible control domain. 

many articles, authors refer to the Minimum Principle, which simply results from 
the definition of the Hamiltonian H with an opposite sign of the functional. Comparing to 
definition (14.98), they define their Hamiltonian as 

ix(t), '0(t), t) = F(a?(t), ix(t), t) + f{x{t)^u{t)^t) (14.113) 

With that definition, the optimal control u* minimizes the Hamiltonian. 
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• The control variables Ui are physical, thus they are constrained, e.g. \ui\ < 
^max, and we consider that they belong to a domain U. The admissible 
controls are piecewise continuous, that is, they are continuous nearly every- 
where, except at some instants where they can undergo first-order disconti- 
nuities (jump from one value to another). 

• Very frequently, the optimal control is composed by piecewise continuous 
functions: the control jumps from one summit of the polyhedron defined 
by U to another. These cases of control occupying only extreme positions 
cannot be solved by classical methods. 

The process is described by a system of differential equations 



x\t) = f{x{t),u{t)) i = l,...,n 



(14.114) 



We seek an admissible control u that transfers the system from point Xq in the 
phase space to point x / and minimizes the criterion 



rtf 

J = G{xo,to,Xf,tf) ^ / F{x{t),u{t)) dt 

Jto 



(14.115) 



To the n coordinates x'^ in the phase space, we add the coordinate defined^ 
by 

x^ = G{xo^to^x{t)^t) F I F{x{t),u{t)) dr (14.116) 

Jto 

so that if X = Xf then x^{tf) = J. Its derivative is equal to 
dx^ 



dt 



— ^xf F Gt -\- F{x{t)^ '^( 0 ) 



(14.117) 



If time occurs explicitly in the terminal conditions (14.2), or if it is not first 
fixed, we add the coordinate x^~^^ to the state (Boudarel et ah, 1969), such 
that 

j. 

(14.118) 



^n+l _ ^ 
^n+1 _ 2 



The complete system of differential equations would then have dimension n + 2. 
In the following, in order not to make the notations cumbersome, we will only 
consider stationary problems of dimension n + 1 in the form 



F = i = 0, ...,n (14.119) 

by deducing from Eq. (14.116) by derivation (extended notation /). 

In the phase space of dimension n + 1, we define the initial point Xq and a 
straight line tt parallel to the axis x^ (i.e. the criterion), passing through the 
final point Xf. The optimal control is, among the admissible controls such that 

^This notation is that of Pontryaguine et al. (1974). The superscript corresponds to the 
rank i of the coordinate while the subscripts (0 and 1) or (0 and /), according to the authors, 
are reserved for the terminal conditions. 
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the solution x{t), having as the initial condition xq^ intersects the line tt, the 
one which minimizes the coordinate at the intersection point with tt. 

We introduce the costate variables ^ such that 






J=0 



i = 0, ...,n (14.120) 



This system admits a unique solution ip composed of piecewise continuous 
functions, corresponding to the control u and presenting the same discontinuity 
points. 

We then consider the Hamiltonian to be equal to the scalar product of 
functions p and / 

n 

H{tp,x,u) =tp'^ f = i=0,...,n (14.121) 

^=0 



The systems can be written again in the Hamilton canonical form 



dx^ _ dH 

dt dpi 
dp, dH 

dt dx'^ 



i = 0, . . . , n 
i = 0, . . . , n 



(14.122) 



When the solutions x and p are hxed, the Hamiltonian depends only on the 
admissible control n, hence the notation 



M{p^x) = H{p^x^u) (14.123) 

ueu 

in order to mean that A4 is the maximum of H at hxed x and '0, or further 

H{p\xpu^) > H{p\xpu^ + 5u) ^5u (14.124) 

We consider the admissible controls, dehned on [toT/]? fo be responding to 
the previous dehnition: the trajectory x(t) issued from xo at to intersects the 
straight line tt at tf. According to Pontryaguine et al. (1974), the hrst theorem 
of the Maximum Principle is expressed as: 

So that the control u(t) and the trajectory x(t) is optimal, it is necessary that 
the continuous and nonzero vector, p{t) = [po{t),pi{t), . . . ,pn{t)] satisfying 
Hamilton canonical system (14.122), is such that: 

1. The Hamiltonian H[p{t), x{t), '^( 0 ] reaches its maximum at point u = u{t) 
Vt G [to^tf], thus 

x{t), u{t)] = x{t)] (14.125) 

2. At the end-time t/, the relations 

popf) < 0 ; M[ppf), xpf)] = 0 (14.126) 

are satished. 

With Eq. (14.122) and condition (14.125) being verihed, the time functions 
pop) and Ad[pp),xp)] are constant. In this case, the relation (14.126) is 
verihed at any instant t included between to and tf. 
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14.4.7 Singular Arcs 



In optimal control problems, it often occurs for some time intervals that the 
Maximum Principle does not give an explicit relation between the control and 
the state and costate variable: this is a singular optimal control problem which 
yields singular arcs. 

Following Lamnabhi-Lagarrigue (1987), an extremal control has a singular 
arc [a, b] in [to^tf] if and only if , a:^*, n*) = 0 and ^ a:^*, n*) = 0, 

for all t G [a, b] and whatever ^0* satisfies the Maximum Principle. 

Example 14.2: Linear Quadratic Control using Hamilton- Jacobi and 
Pontryagin Methods 

The previous example is considered again, and, first treated in the framework 
of Hamilton- Jacobi equations and then in the framework of the Maximum Prin- 
ciple. 

Minimize the performance index 

J=- f {u^ Xi) dt (14.127) 

2 Jo 

given 

±1 = /i = axi + bu (14.128) 

Assume that no constraint exists on the control u. 

a) In the context of Hamilton- Jacobi equations, the Hamiltonian would be equal 
to 

H = —F + /i = — ^ + xf) + Jji {axi + bu) (14.129) 

The Hamilton canonical conditions are 



X =H^ 
^ =-Hx 



( ±i =ax\Fbu with: xi(J)) = xi^ 



dG 



ipi =Xi—aJji with: — =0 

dx f 

(14.130) 

Maximize H with respect to the control u. As the control u is not constrained, 
it results that 

BH 

= —u FbJji =0 (14.131) 



Bu 

hence the optimal control 

u = bi/ji 

From the previous equations, we draw 

xi — (a^ + b‘^)xi = 0 



(14.132) 

(14.133) 



The initial state xi(0) (terminal condition) is given and the final adjoint va- 
riable '0(t/) (transversality condition) is also known. Thus, this is again a 
two-boundary problem with the same numerical issue previously stressed in 
the same example treated by Euler conditions. 
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b) In the context of Pontryagin Maximum Principle, define the coordinate xq 
( corresponding to of Eq. (14.116) and here denoted in subscript to avoid 
confusion with the powers in superscript) such that 

*0 = /o = ^ {X' + xj) (14.134) 

The Hamiltonian, defined according to the Maximum Principle is equal to 

= "00 /o + V'l /i = ^ "00 {u'^ + Xi) + V'l {axi + bu) (14.135) 

The Hamilton canonical equations are 




= 1 + xl) 

= = axi + bu with: xi(0) = Xi^ 

= 0 





dH 


±0 


dtpo 




dH 


Xl 


dipi 


i’o 


dH 


dxo 


V'l 


dH 


dxi 



dG 



= —xboXi—xbia with: '0i(tf) = — ^ — =0 

ox f 

(14.136) 

The first two equations of this system are the state equations. By setting ci as 
a constant (equal to —1 according to the Hamiltonian given by Eq. (14.129)), 
from the two following equations, we draw 



■>Po{t) = Cl 

'ipi = —ipi a — CiXi 



(14.137) 



We notice that which concerns the criterion, is constant as already men- 

tioned in the general theory. Maximize H with respect to the control u. As 
the control u is not constrained, it gives 



dH 

du 






(14.138) 



hence the optimal control 



u = -- 



b 

Cl 



(14.139) 



By using the state Eq. (14.128) and Eq. (14.137), we obtain the same differen- 
tial equation as (14.88) 

X — -\- b‘^)x = 0 (14.140) 



Indeed, in the framework of dynamic optimization, Eqs. (14.136) will be inte- 
grated with the terminal conditions, including the condition at the fixed ini- 
tial state xi(0) = xiq and the transversality condition at the adjoint variable 
= 0 at the final time. Of course, the numerical difficulties previously 
evoked concerning Example 14.1 treated by the Euler equations are encoun- 
tered here. 
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Example 14.3: Mimimum-time Problem with Constraints on the 
Control 

Consider the following classical example of mechanics: a system reduced to a 
point is described by its position y, its velocity v and its acceleration u] the 
latter is thus the input which governs the system position. Although this model 
is outside the traditional scope of chemical engineering, it has many merits. Es- 
sentially, it is very simple so that the different cases studied provide analytical 
solutions which can be easily understood. The transposition to the chemical 
engineering field is not necessarily immediate but, for example, the problems 
of minimum time are very similar. Also, the influence of constraints appears 
clearly and it illustrates perfectly how the Hamilton- Jacobi method is used. 

The physical system is described by the following linear second-order model 

2/(2) = u (14.141) 

By setting: x\ = y and X 2 = ^, we get the equivalent state-space model 



Xi = X2 
±2 = U 



(14.142) 



We wish to go from the state (0, 0) to the state (A, 0) in a minimum time. 
This corresponds to a variation in amplitude A on the output: y = x\^ while 
imposing that its derivative remains zero when t ^ 0 and t ^ tf. We impose 
that the control must stay in the interval [—1, 1]. 

The performance index is thus 

rtf 

J= dt (14.143) 

Jo 

We treat this problem with Hamilton- Jacobi equations, and the Hamiltonian 
is equal to 

H = — 1 + ^01 X2 + V^2 (14.144) 

We obtain 

H can be represented with respect to the bounded control ix, and we notice 
that, with this control being linear with respect to xx, the maximum of H is 
reached when u is equal to —1 or +1 according to the sign of ^02 (Fig- 14.1). 

It results from Fig. 14.1 that 



u = sign(V^ 2 ) 



(14.146) 



hence 



= -1 + X2 + |'02| 



(14.147) 
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(a): V ^2 < 0 (b): V ^2 > 0 

Figure 14.1: Variation of the Hamiltonian with respect to the bounded control 
(|ix| < 1) and the sign of ^02 



On the other hand, the Hamilton- Jacobi Eq. (14.111) is 

Jt - H{x\u\x, -Jx, t), -Jx, t) = 0 (14.148) 

with the criterion J dehned by Eq. (14.106), which verihes 

rtf 

J{x"^,t)= J dr Jt = -1 and: Jx = ^ (14.149) 

The Hamilton- Jacobi equation results in 

H{x\u^{x, -Jxj). -Jx. 0 = -1 ^ ^2 + IV^2| = 0 (14.150) 

Erom the differential equations describing the variation of '0, we deduce 



= Cl 

V^2 = -Cl tJC2 



(14.151) 



where ci and C 2 are constant. As ^02 depends linearly on t, ^02 crosses the value 
0 at the maximum only once at the commutation instant tc, if the latter exists. 
The control u thus can take only two values at most, and each value only once 

if t G [0, tc], '^ = if ; ift^]tcjf],u = ^l (14.152) 

This type of control is called bang-bang. Let uq = We obtain the solutions 
of the trajectories 



lit e [0,tc],u = Uo, < 2 

X2 = Uot 

, j xi = — ^ {t — tf)'^ + A 

lit e]tc,tf],u= -Uo, < 2^ 

y X 2 = -Uo {t - tf) 



(14.153) 



Note that xi{t) is continuous at the commutation time tc, so that 
xi(t-) = Xi(t+) 



Mtc ) = Y^c 

Mtt) = -y(C + ^ 



(14.154) 
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which gives 

(tc-tff +tl = 2— (14.155) 

Uo 

This equation implies that the right-hand term is positive, so that 

uq = sign(A) (14.156) 



Equation (14.155) becomes 

(tc — tf) = 2|A| > tf = tc~\- \/2| A| — (14.157) 

As the criterion is J = tf, it suffices to minimize the expression of tf given by 
Eq. (14.157) with respect to tc to obtain the commutation time 



= 0 ^ t. = ypT 

cite 

In summary, we obtain the three unknowns tc, tf, uq as 

Uo = sign(A) 




tf = 2,/\A\ w 2.828 



(14.158) 



(14.159) 
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Figure 14.2: Variation of the input 

The input presents the bang-bang aspect of Eig. 14.2. The commutation 
instant is indicated by an arrow. The phase portrait (Eig. 14.3) shows the two 
arcs and the output trajectory effectively presents a horizontal tangent at the 
beginning and the end. 

This type of problem would correspond to a car driver wanting to go from 
one point to another in minimum time over a short distance. He first accelerates 
at the maximum, then brakes at the maximum. In chemical engineering, when 
the minimum time is searched for an operation, often the solution is that a 
valve is fully open, then fully closed. 
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Figure 14.3: Left: phase portrait, right: trajectory of the output (chosen 
value of the hnal state xi: A = 2) 



Example 14.4: Miminimum-time Problem with Constraints on the 
Control and on the State 

Consider the same example as previous but further assume that the state X 2 
can be bounded, which amounts to a limit for x\ \n its rate for reaching the 
hnal state, thus 

\x2\<v < 0 (14.160) 

Taking into account the constraint, the Hamiltonian can be written as 

H = -1 + X2 + V^2 ^ + /^i(^2 “ (14.161) 



where /ii is a Kuhn-Tucker parameter. It results 

/ii = 0 if X 2 — < 0 (inside the domain, with respect to \x 2 \ < v) 

/ii < 0 if X 2 — = 0 (on the constraint \x 2 \ = v) 

(14.162) 

The condition /ii < 0 comes from the fact that H must be a maximum. We 
obtain 

^01 = 0 

^2 = -^1 - 2111X2 



(14.163) 



At most, two commutation (or junction) instants td and tc 2 will exist such 
that 



Ai — 



I Wo I 



A2 — t f tcl 



(14.164) 



Note that the hnal time tf is unknown and will be determined by the hnal 
condition on the state x\. 

r _ ^2 

at e [ 0 ,tci],u = Uo, < 2 

I X2 = Uot 



tcl . 



iite]tcl,tc2],U = 0, J -^0 2 (2^ ^cl) 

[ X2 = Uo tcl 

lit e]tc2,tf],U = -Uq, I 

\ X2 = -Wo it - tf) 



(14.165) 
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We deduce 



I Wo I 



A 



tf = l^(A+-) = - sign(«o) + ^ 

UqV Uo V | mo | 



(14.166) 



This final time is equal to 3.12 in the case where the state X 2 is bounded by 
0.9 and is larger than the final time without constraint on the state. 

The time interval [tciTc 2 ] corresponds to a singular arc and is called a 
singular time interval. 

The input (Fig. 14.4) shows first its saturation, then the singular part, then 
again a saturation with an opposite value. Commutation instants are indicated 
by arrows. The phase portrait (Fig. 14.5) displays the two commutation times 
and the saturation for X 2 = 0.9. We notice on the trajectory of the output xi 
that during the saturation of X 2 (rate of xi), the input u being constant, xi 
increases linearly. 
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Figure 14.4: Variation of the input 




Figure 14.5: Left: phase portrait, right: trajectory of the output (chosen 
value for the final state xi: A = 2 and the state X 2 is bounded: \x 2 \ < 0.9) 
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14.4.8 Numerical Issues 

In general, the dynamic optimization problem results in a set of two systems 
of first-order ordinary differential equations 



X with: x{to) = Xq 

'0 = with: 



(14.167) 



where to and tf are initial and hnal time respectively. Thus, it is a two-point 
boundary-value problem. A criterion J is to be minimized with respect to a 
control vector. In general, in particular for nonlinear problems, there is no 
analytical solution. 

Different numerical techniques (Bryson, 1999; Latih et ah, 1998) can be 
used to hud the optimal control: 

Boundary Condition Iteration: 

First, the adjoint vector is initialized and the system (14.167) concerning both 
X and ip is integrated. The control vector u(t) results from the maximization 
of the Hamiltonian with possible constraints on u. The resulting values of ip at 
hnal time tf are compared with the required values From this comparison, 
new values of the adjoint vector are deduced and the process repeated until 
convergence. 

Multiple Shooting: 

This method is very similar to boundary condition iteration except that inter- 
mediate points are used in [toCf] fo decompose the problem into a series of 
boundary condition iteration problems. 

Quasi- linearization: 

The system (14.167) is linearized around a reference trajectory. 

Invariant Embedding: 

The initial two-point boundary-value problem is transformed into an initial 
value problem which is now a system of partial differential equations. 

Control Veetor Iteration: 

Initially, a control vector is assumed. Then, the state equations are integrated 
forwards in time and then the adjoint equations backwards in time. A gradient 
method can be used to estimate a new control vector as 



Unew{t) = Uold{t) + a— (14.168) 

au 

The choice of the magnitude of displacement a in the gradient direction can 
be performed by a line search method (Fletcher, 1991) or Rosen gradient pro- 
jection method(Soeterboek, 1992). In some minimum-time problems such as 
batch styrene polymerization (Farber and Laurence, 1986), a method based 
on coordinate transformation can be used, which simply needs that one of the 
state variables be monotonic such as in general conversion (Kwon and Evans, 
1975). 

Control Veetor Parametrization: 

In this method (Goh and Teo, 1988; Teo et ah, 1991), the control vector is 
approximated by basis functions. The control prohle can be chosen in different 
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ways: piecewise constant, piecewise linear or piecewise polynomial, so that the 
control is expressed as 

u{t) = '^ai 4>i{t) (14.169) 

where cj)(t) are adequate basis functions, which can be Lagrange polynomials. 
Thus the optimization is realized with respect to parameters a^, which govern 
the control profile. 

This method has been used by Fikar et al. (2000) to determine the optimal 
inputs during the transition from one couple of set points to another for a con- 
tinuous industrial distillation column. A full model of the distillation column 
was used by the authors. 

Collocation on Finite Elements or Control and State Parameterization: 

Both state and control variables are approximated by basis functions such as 
Lagrange polynomials (Biegler, 1984; Cuthrell and Biegler, 1987). The final 
problem is solved by nonlinear programming. 

Consider the following general optimization problem in the form 



(ix, q)* = arg{ min J} 

u(t),q 



with the criterion to be minimized 



J = 







$[x(t), u{t), q, t]dt 



(14.170) 



(14.171) 



subject to the constraints 



g[x{t),u{t),q,t] < 0 
h[x{t)^ u{t)^ qC] = ^ 

x{t) = f[x{t),u{t),qC] 
< x{to) = Xo 

^inf ^ x{tS) Cl Xgup 
'^inf Cl u{td) C '^sup 
^ Qinf C q C qsup 



instantaneous inequality constraints 

instantaneous equality constraints 

state model 

initial conditions 

state domain 

control domain 

parameter domain 



(14.172) 



where u is the control variables vector, q is the control parameters vector, J is 
the performance criterion. 

Assume that the time domain is divided into subdomains. In this case, an 
orthogonal collocation is realized at each subdomain = [ce^, ce^+i], called a finite 
element Ace^, which is bounded by two knots: ai and whose positions are 
a priori unknown. With these finite elements, the state and control variables 
are approximated by Lagrange polynomials of orders nc and nc—1 respectively 



Xnca) = 

1=0 

nc 

J = 1 



nc 

^ija)= n (~ — for i = l,---,ne 

Tjd — Tjh 

k=0,^j 

(14.173) 

nc 

= n (— — for i = l,...,ne 
k=Sj r-ife 

(14.174) 
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where ne is the number of hnite elements. The dimensionless time, r, allows 
to treat easily free end-time problems. 

The Tij are dehned as 

Tij = - ai) ; i = l,...,ne ; j = 0, ...,nc (14.175) 

where 70 = 0 and the jj ( j = 1 , . . . , nc) are the zeros of a Legendre polynomial 
dehned on [ 0 , 1 ]. 

Replacing the state and control variables by their polynomial approxima- 
tions in the state system leads to the following algebraic residual equations 



nc 

^{ni) = - tf.f[au,bu,q,Tu] =0 ; I = 1, ... ,nc ; i = 1, ... ,ne 

j=0 

(14.176) 

where 

^iAni) = (14.177) 



Then, it is necessary to impose the continuity of the state variables between 
two successive hnite elements and to bound the extrapolation of the control 
variables at both ends of the hnite elements, since they are only dehned inside 
each element. 

Finally, the problem (14.170) can be approximated as follows 



{au,hii,q,ai,tf)* = min J[au,hu,q,tf\\ (14.178) 

au,bii,q,ai,tf 



subject to 



g[ail,bii,q,ru] < 0 
h[aii,bii,q,Tii] = 0 

r{ru) = 0 
aio = Xo 

= E^=o = 1) 

'^inf ^ — ^sup 

'^inf ^ — ^sup 

Xinf ^ btii ^ Xgup 
'^inf ^ ^ '^sup 

^ Qinf ^ Q ^ Qsup 



(14.179) 



The optimization problem parameters are the state and control variables val- 
ues at the collocation points dehned by Eq. (14.175), the hnal time, and the 
position of the knots, which correspond at convergence to the control variable 
discontinuities. The state and control variables are then completely dehned by 
Eqs. (14.173) and (14.174). This method allows us to easily handle all types 
of constraints. 

The resulting nonlinear problem (14.178) can then be solved using a succes- 
sive quadratic programming technique, such as that developed by Schittkowski 
(1985). 
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The drawback of this technique is that a large number of parameters to be 
optimized are generated. The advantage is that it does not require the use of 
Hamiltonian and adjoint equations. However, this method has been successfully 
used by Centric et al. (1999) to calculate off-line the optimal temperature profile 
of a batch emulsion copolymerization reactor. 

Iterative Dynamie Programming: 

The system is simply described by the dynamic model 

x = f{x,u) (14.180) 

where the initial state vector x^ is given. A performance index J{x(tf)) is to 
be minimized, the final time tf being specified. The optimal control policy is 
searched by stage to stage optimization, where stages correspond to time subin- 
tervals. No gradients are necessary and no auxiliary variables are introduced. 
The general principle (Luus, 1996) is the following: 

1. Divide the total time interval [to, tf] into P subintervals [0, ti], . . . , t^], 

etc. of equal length L. 

2 . Choose an initial estimate of control u for each stage, giving the control pol- 
icy vector u of dimension P. The control belongs to an initial domain denoted 
by Vin^ Choose the region contraction factor 7 and the region restoration factor 

T]. 

3. Choose the number of iterations jmax for each pass and the number of passes 

^max* 

4. Set pass index q = 1 and iteration index j = 1. 

5. Set = 77 ^ Tin- 

6 . Integrate dynamic model (14.180) to get the state vector x{k — 1) at each 
stage k. 

7. From that stage, the principle of dynamic programming is really involved. 
Beginning at stage P, integrate model (14.180) from tf — L to tf for each of 
the R allowable values of continuous control vector given by 

u{t) = u{k) [u{k 1) — u{k)]{t — tk) / L , V/c (14.181) 



with 

u{P-l) =u*{P-lY ^^ , u{P) =u*{pY^^ +02^^^ (U.182) 

where the superscript * stands for the best value of the previous iteration and 
Di and D 2 are diagonal matrices of different random numbers between 0 and 
1 . The control values should satisfy the constraints Umin ^ u < Umax- The 
control values that give the best performance index are retained as u{P — 1) 
and u{P). 

8 . Proceeding backwards, continue the procedure of step 7 until initial time 
t = 0. 

9. Reduce the region for allowable control 

fpij ^ 77 * ^ 



(14.183) 
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using the best control policy coming from step 8 (denoted by *) as the mean 
value for the allowable control. 

10. Set j = j -\- 1 and go to step 7. Continue until j < jmax- 

11. Set q = I and go to step 5. Continue until q < g^max- 

Variants of this method are given in the following: Banga and Carrasco (1998); 
Bojkov and Luus (1996); Carrasco and Banga (1997); Luus and Hennessy 
(1999); Mekarapiruk and Luus (1997). Examples of applications to batch che- 
mical reactors are often cited in these articles. 



14.5 Dynamic Programming 

14.5.1 Classical Dynamic Programming 

Dynamic programming (Bellman, 1957; Bellman and Dreyfus, 1962) has found 
many applications in chemical engineering (Aris, 1961; Roberts, 1964), in par- 
ticular for economic optimization problems in rehneries, and was frequently 
developed in the 1960s. Among the typical examples, are the optimization of 
discontinuous reactors or reactors in series, catalyst replacement or regener- 
ation, the optimization of the counter-current extraction process (Aris et ah, 
1960), the optimal temperature prohle of a tubular chemical reactor (Aris, 
1960), the optimization of a cracking reaction (Roberts and Laspe, 1961). 

In a different domain, a famous problem is the traveller who, having to 
go from one point to another, must optimize his travel, which includes the 
possibility of going through different towns. 

Optimality Principle (Bellman, 1957): 

A policy is optimal if and only if whatever the initial state and the initial 
decision, the decisions remaining to be taken constitute an optimal policy with 
respect to the state resulting from the first decision. 

Because of the principle of continuity, the optimal final value of the criterion 
is entirely determined by the initial condition and the number of stages. In fact, 
it is possible to start from any stage, even from the last one. For this reason, 
Kaufmann and Cruon (1965) express the optimality principle in the following 
manner: 

A policy is optimal if, at a given time, whatever the previous decisions, the 
decisions remaining to be taken constitute an optimal policy with respect to the 
result of the previous decisions, 
or further. 

Any sub-policy (from Xi to Xj ) extracted from an optimal policy (from Xq to 
xn) is itself optimal from Xi to Xj. 

At first, dynamic programming is discussed in the absence of constraints, 
which could be terminal constraints, constraints at any time (amplitude con- 
straints) on the state x or on the control u, or inequality constraints. Moreover, 
we assume the absence of discontinuities. 

In fact, as this is a numerical and not analytical solution, these particular 
cases previously mentioned would pose no problem and could be automatically 
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considered. 

In continuous form, the problem is the following: 

Consider the state equation 

X = f{x, u) with: a:^(0) = Xo (14.184) 

and the performance index to be minimized 

rtf 

J{u)= / r{x^u)dt (14.185) 

Jo 

where r represents an income or revenue. 

In discrete form, the problem becomes: 

Consider the state equation 



^n+l — /(^n? (14.186) 

with At = tn-\-i — tn- The control Un brings the system from the state x^ to 
the state Xn+i and results in an elementary income r{xn^Un) (integrating, in 
fact, the control period At, which will be omitted in the following). 

According to the performance index in the integral form, define the perfor- 
mance index or total income at instant N (depending on the initial state Xo 
and the policy followed from 0 to A — 1, bringing from the state Xo to 

the state xn) as the sum of the elementary incomes r{xi^ Ui) 



A^-l 

Jo = r{xi,Ui) (14.187) 

^=0 

The values of the initial and final states are known 



x{to) = Xo ] x{tN) = XN (14.188) 

If the initial instant is n, note the performance index J^. 

The problem is to find the optimal policy constituted by the suc- 

cession of controls ix* (x = 0, . . . , A — 1) minimizing the performance index Jq. 
We define the optimal performance index J"^{xo, 0) as 



A^-l 

J*(a :^0 5 0) = niin Jo = min r{xi^Ui) (14.189) 

Ui Ui ^ ^ 

^=0 

This performance index bears on the totality of the N stages and depends on 
the starting point Xq. In fact, the optimality principle can be applied from any 
instant n, to which corresponds the optimal performance index J*(a:^^,n) 
From the optimality principle, the following recurrent algorithm of search 
of the optimal policy is derived 



J*{Xn, n) = min[r(a;„, it„) + J*(a;„ + f{Xn, itn), n + 1)] 

Un 



(14.190) 
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which allows us to calculate the series J*(xn^ n)^ J*(xn-i^ n — 1), . . . , 0) 

from the hnal state x^. 

If the hnal state is free, choose A^) = 0. In the case where it is 

constrained, the last input is calculated so as to satisfy the constraint. 

The algorithm (14.190) could be written as 

J*{Xn,n) = minu„[r (Xn,Un) + [r(a;„+i, + J* {Xn+2,n + 2)]] 

= min„„ [r{Xn, Un) + [r(a;„+i, + ...]] 

(14.191) 

However, a difficulty resides frequently in the formulation of a given problem 
in an adequate form for the solving by means of dynamic programming and, 
with the actual progress of numerical calculation and nonlinear constrained 
optimization methods, the latter are nowadays employed more. A variant of 
dynamic programming (Luss, 1990) called iterative dynamic programming can 
often provide good results with a lighter computational effort (Luss, 1993; Luss 
and Bojkov, 1994). 

Example 14.5: Application of Dynamic Programming 

Consider again a particular form of example (14.84) in discrete form. The 
stable system of dimension 1 is then defined by the following discrete-time 
state equation 

Xk+i = f{xj u) = 0.5 Xk + Uk (14.192) 

and the discrete performance index 

3 3 

J = ^ r(xfe, Uk) = ^(4 + 4) (14.193) 

/c=0 /c=0 

Terminal constraints are given 



Xq = 0 
X4 = 1 



(14.194) 



In fact, this is a single-input single-output case of discrete linear quadratic 
control. This problem has been solved using Maple© by symbolic computation. 

We therefore wish to find the control Uk such that the sum of the costs rk up 
to time k is minimum when the state goes from Xk to At each instant n, 

we will calculate the control Un such that the performance index is minimized 
in recurrent form as 



J*(xn, n) = min[r(xn, Un) + J*(xn+i, n + 1)] (14.195) 

Un 

beginning from the last instant N . 

The elementary income is equal to 

r(xk,Uk) = xl+ul (14.196) 



The process consists of four stages, thus N = A. Set: a = 0.5. 
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First stage: 

The final state is fixed: X4 = 1. It results that 



X 4 = axs -\- Us = 1 (14.197) 

which, when joined with the expression (14.192) of the state xs 

xs = ax2 F U2 (14.198) 

gives the optimal control 1x3 

Us = l — a{ax2 + U 2 ) (14.199) 

which depends on the value of the state X 2 and the control 1x2 • 

Second stage: 

The performance index is transformed by using the expression of xs from Eq. 
(14.192) and the expression (14.199) of the optimal control 1x3 

J (xo, 0 ) = (xq + xxq) + (^1 + '^1) + (^2 + '^2) + (^3 + '^3) 

= (xq + xxq) + {x\ -\- xx^) + {x‘^ -\- XX2) + (nx2 + '^2)^ T [1 — a(^ax 2 + '^2)]^ 
= (Xq + XXq) + (x^ + xx^) + J (x2, 2) 

(14.200) 

Note that at this instant, the performance index is composed of two parts, one 
depending on instants from 0 to 1, the other one J(x2,2), which is essential 
depending on the instant 2, thus on the state X 2 and the control xx2- The 
performance index J(x2, 2) must be minimum with respect to xx2, which gives 
the optimal control xxj 



dJ{x2,2) ^ ^ 

du2 

2 xx2 + 2 {ax 2 + XX2) — 2a[l — a{ax 2 + XX2)] = 0 
^ a{x2-lFoSx2) 



2 + a2 



= -0.278x2 + 0.222 



(14.201) 



Third stage: 

The following is given numerically because of the complexity of analytical ex- 
pressions. The performance index J(xq, 0) is transformed by using the expres- 
sion of X 2 from Eq. (14.192) and the expression (14.201) of the optimal control 
xxj 

J (xq, 0 ) = (xq + xxq) + (x^ + xx^) + (0.5xi + xxi)^ + ( — 0.139xi — 0.278xxi + 0.222) 
+ (O.lllxi + 0.222 mi + 0.222)^ + (0.889 - 0.556xi - O.lllui)^ 

= (Xq + Uq) + J (xi, 1 ) 

(14.202) 

The performance index J(xi, 1) must be minimum with respect to xxi, which 
gives the optimal control u\ 



dJ(xi, 1) 
dui 



ul = -0.266x1 + 0.0519 



(14.203) 
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Fourth stage: 

Finally, the performance index is transformed by using the expression of xi 
from Eq. (14.192) and the expression (14.203) of the optimal control ul 

J (xq, O) = -\- r^g) ~\~ (0.5xq ~\~ '^o)^ 4“ ( — 0.133xo — 0.266ixo 4“ 0.0519) 

+ (0.117x0 + 0.234mo + O.OSlof + (-0.0325xq - 0.0649uo + 0.20Sf 
+ (0.0260x0 + O.OSWuo + 0.234)^ + (0.883 - 0.0130xq - 0.0260uo)^ 

(14.204) 

This performance index must be minimum with respect to uq, which gives the 
optimal control Uq 

dJ(xo, 0 ) ^ Q ^ ^ -0.266x0 + 0.0122 (14.205) 

duQ 



Table 14.1: Succession of the optimal states and controls obtained by dynamic 
programming 



Instant k 


State Xk 


Control xx^ 


0 


0 


0.01218 


1 


0.01218 


0.04871 


2 


0.05479 


0.20700 


3 


0.23440 


0.88280 


4 


1 





By knowing the value of the original state xg, the succession of inputs and 
thus of the states is easily calculated in Table 14.1. 



14.5.2 Hamilton- Jacobi-Bellman Equation 

Given the initial state Xq at time tg, considering the state x and the control ix, 
the optimal trajectory corresponds to the couple (x^u) such that 



J*{xo,to) = mmJ{xo,u,to) 

u{t) 



(14.206) 



thus the optimal criterion does not depend on the control u. 

In an interval [t, t + At], the Bellman optimality principle as given in the 
recurrent Eq. (14.190) can be formulated as 



J*(x(t),t) =mml f 

u(t) 



t+At 



r{x^ XX, r)dr + J"^{x{t + At), t + At) > (14.207) 
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This can be expressed in continuous form as a Taylor series expansion in the 
neighbourhood of the state x{t) and time t 



dJ* 

r{x, u, t) At + t) + -j— At+ 

J*(a;(t),t) =min<^ / qj*\T 



) (14.208) 



Taking the limit when At ^ 0 results in the Hamilton- Jacobi-Bellman equation 






(14.209) 



As the optimal criterion does not depend on control ix, it yields J*(a:^(t/), t/) = 
W{x{tf)), which gives the boundary condition for the Hamilton- Jacobi-Bellman 
Eq. (14.209) 

r{x,tf) = W{x) , Mx (14.210) 

The solution of Eq. (14.209) is the optimal control law 

B 7 * 

u^=g{ — ^x,t) (14.211) 

which, when introduced into Eq. (14.209) gives 

-— = r(:i., t) + (^— j fix, g, t) (14.212) 

whose solution is J*(a:^, t) subject to the boundary condition (14.210). Equation 
(14.212) should be compared to the Hamilton- Jacobi Eq. (14.60). Then, the 
gradient BJ"" jdx should be calculated and returned in (14.211), which gives 
the optimal state-feedback control law 

dJ* 

u" = = h{x, t) (14.213) 

This corresponds to a closed-loop optimal control law. 



14.6 Linear Quadratic Control 

Among the numerous publications concerning linear optimal control, are, in 
particular, the books by Anderson and Moore (1971, 1990); Athans and Ealb 
(1966); Bryson and Ho (1975); Grimble and Johnson (1988a, b); Kirk (1970); 
Kwakernaak and Sivan (1972); Lewis (1986), and more recently, in robust con- 
trol Maciejowski (1989). Eurthermore, among reference papers, cite Kalman 
(1960, 1963); Kalman and Bucy (1961). Linear quadratic control is presented 
here in the previously discussed general framework of optimal control. 
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14.6.1 Continuous-Time Linear Quadratic Control 



In continuous time, the system is represented in the state space by the deter- 
ministic linear model 



r x{t) = Ax{t) -\- B u{t) 

I y(t) = Cx{t) 



(14.214) 



where u is the control vector of dimension riu^ x the state vector of dimension 
n and y the output vector of dimension Uy. C ^ are matrices of respective 

sizes n X n^n X riu^ riy x n. 

The control u must minimize the classical quadratic criterion 

rtf 

J = 0.5 x"^ (tf) Q x(tf) 0.5 / [x^^ (t) Q x(t) (t) Ru(t)]dt (14.215) 

Jto 

where matrices Qf , Q, are symmetrical semipositive dehnite, whereas R is 
symmetrical positive dehnite. This criterion tends to bring the state x towards 
0. Other criteria have been derived from the original criterion by replacing the 
state by z with z = Mx^ where z represents a linear combination of the 
states, e.g. a measurement, or the output if M = C . The problem is then 
the regulation of z. It is possible to incorporate the tracking of a reference 
trajectory = Mx^ by replacing the state x with the tracking error — z). 

Thus, the most general criterion can be considered, which takes into account 
the different previous cases 

J =Q.b{z'^ - - z){tf) 

r^f nr nr (14.216) 

+0.5 / [{z^ — z)^ (t) Q {z^ — z){t) (t) Ru{t)]dt 

Jto 

or 

J = 0.5 {x^ - xY(tf) Qf M {x^ - x){tf) 

+0.5 f [{x'^ — x)"^ (t) Q M {x'^ — x){t) (t) Ru{t)]dt 
Jto 

(14.217) 

The matrices Qf,Q,R must have dimensions adapted to the retained criterion. 

According to the techniques of variational calculation applied to optimal 
control, introduce the Hamiltonian as in Eq. (14.98) 

H = -Q.b[{x^ -xf ft) Q M fx'^ -x){f)+u^ ft) Ruft)]+'ipft)'^ [Ax+B u] 

(14.218) 

The Hamilton canonical Eqs. (14.99) provide, besides the state Eq. (14.214), 
the derivative of the costate vector 

=^ft) = -M'^ QMfx'' -x) - (14.219) 

with the final transversality condition (assuming that the state Xf is free, that 
is, not constrained) 

-ifftf) = M'^ QfM fx'' -x)f 



(14.220) 
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Moreover, in the absence of constraints, the condition of maximization of the 
Hamiltonian with respect to u gives 

dH rj. 

= 0 = -Ru + i) (14.221) 

ou 

hence the optimal control 

u{t) = R~^ -ipit) (14.222) 



Note that if a different definition of the Hamiltonian is used with a positive sign 
before the functional, the previous formula results in a negative sign and the 
non-diagonal terms of the first square matrix of Eq. (14.223) will have opposite 
signs as in Eq. (14.235) defining the Hamiltonian matrix. It would suffice to 
change ip in —pj. 

Gathering all these results, the system to be solved becomes a two-point 
boundary- value problem 



'x{t) ' 









A 


BR-^B^ 


x(t) 




M^QM 









0 0 
M^QM 0 



x{to) = Xo 

'Ipitf) = M'^ Qf M {x^ - x)f 



x^{t) 

0 



(14.223) 



Regulation Case: = 0 

A classical approach consists of introducing the transition matrix corresponding 
to the previous differential system, which can be partitioned such that 



x{t) 




^xx{T,t) ^x^{T,t) 




x{t) 


_ V’('t) _ 











re[t,tf] (14.224) 



This 

thus 



equation can be used at the final instant r = tf (where '0(t/) is known) 



^(^/) — ^ xx(t f 1 0 ^(0 ^ X'lpip f 1 0 "^(0 

— ^'ipxip f •) 0 ^(0 



(14.225) 



hence 



t) + Qf t)] x(t) 



(14.226) 



a relation which can be denoted by 



-ipit) = -M^ S(t) Mx(t) = -Pc(t) x(t) (14.227) 



both to express the proportionality and to verify the terminal condition (at t/), 
M being any constant matrix. The subscript c of Pc means that we are treating 
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the control problem (to be compared with Pf later used for Kalman hltering). 
The relation (14.227) is also called a backward sweep solution (Bryson, 1999). 
This relation joined to the optimal control expression gives 

u*(t) = -R~^ S(t) M x(t) = -R~^ Pc(t) x(t) (14.228) 

which shows that this is a state feedback control. By setting M = J, we hnd 
the classical formula which equals Pc{t) and S{t) 

u*(t) = -R-^ B^ S(t) x(t) (14.229) 

In fact, the matrix Pc{t) can be calculated directly. Use the relation 



= -Pc{t)x{t) (14.230) 

inside the system (14.223). The continuous differential Riccati equation results 

P^(t) = -Pc(t) A- Pc(t) + Pc(t) BR-^ B^ Pc(t) - Q M 

with: Pc{tf) = Qj^ M 

(14.231) 

where the matrix Pc{t) is symmetrical semipositive definite. Knowing the 
solution of this differential equation, the optimal control law can be calculated 

u*(t) = -R~^ B^ Pc(t) x(t) = -Kc(t) x(t) (14.232) 



Notice that the differential Riccati equation (14.231), being known by its final 
condition, can be integrated backwards to deduce Pc{to), which will allow us 
to exploit the optimal control law in relation to the differential system (14.214). 
If the horizon tf is infinite, the control law is 

u*(t) = -R-'- B'^ Pcx(t) (14.233) 



where the matrix Pc is the solution of the algebraic Riccati equation 

PcA + A^ Pc - PcB R~^ B^ Pc + Q M = 0 (14.234) 



which is the steady-state form of the differential Riccati Eq. (14.231). 

Different methods have been published to solve Eq. (14.234), which poses 
serious numerical problems. A solution can be to integrate backwards the 
differential Riccati equation (14.231) until a stationary solution is obtained. 
Another solution, which is numerically robust and is based on a Schur de- 
composition method, is proposed by Arnold and Laub (1984); Laub (1979). 
Consider the Hamiltonian^ matrix Ti 



n = 



A -BR-^B^ 

-M^QM -A^ 



(14.235) 



matrix A of dimension (2n x 2n) is called Hamiltonian if J~^ J = —A or J = 
—A~^ J A, where J is equal to: / Q ' 

An important property (Laub, 1979) of Hamiltonian matrices is that if A is an eigenvalue 
of a Hamiltonian matrix, —A is also an eigenvalue with the same multiplicity. 
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whose eigenvalues and eigenvectors are sought. Order the matrix U of the 
eigenvectors of the Hamiltonian matrix of dimension 2n x 2n in such a way that 
the first n columns are the eigenvectors corresponding to the stable eigenvalues 
(negative real or complex with a negative real part), in the form 



Un Ui2 

U21 U22 



(14.236) 



where the blocks U ij have dimension n x n. The solution to the algebraic 
Riccati equation (14.234) is then 

Pc = U21 ay (14.237) 



Note that the stationary solution to this problem can also be obtained by 
iterative methods (Arnold and Laub, 1984). 



Tracking Case: ^ 0 

In the presence of the tracking term , the differential system (14.223) is not 
homogeneous anymore, and it is necessary to add a term to (14.224), as 



x{t) 




^ xx { T , t ) 




x(t) 


1 


' 9xi'rA) 


■0(t) _ 












_ _ 



(14.238) 



In the same manner as previously, at the final instant r = t/, we obtain 



^(^/) — 0 ^(0 9 X^ f 0 

t) x{t) + t)tp{t) + g^{tf, t) 

hence 

I t) + Qf t)] x(t) + 

- Xrj) +g^{tf,t)^ 

giving an expression in the form 

= -Pcit) x{t) + 8{t) 



(14.239) 



(14.240) 



(14.241) 



By introducing this expression in (14.219), we again get the differential Riccati 
Eq. (14.231) whose matrix Pc is a solution, with the same terminal condition. 
Moreover, we obtain the differential equation giving the vector s 

s{t) = [Pc{t) - A^] s(t) - QMx^ 

with: s{tf) = Q M x^j: 



(14.242) 
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This equation is often termed feedforward. Knowing the solutions of this diffe- 
rential equation and of the differential Riccati Eq. (14.231), the optimal control 
law can be calculated and applied 

u*(t) = -R-^ Pc(t) x(t) + R-^ s(t) = Ufb(t) + Uff(t) (14.243) 

In this form, Ufb{t) represents a state feedback control (term in x(t)), as the 
gain Kc depends at each instant on the solution of the Riccati equation, and 
Uff{t) represents a feedforward control (term in s{t)). This structure is visible 
in Fig. 14.6. 




Figure 14.6: Structure of linear quadratic control 

In the same manner as in regulation, when the horizon is infinite. Pc is 
solution of the algebraic Riccati equation (14.234) and s is solution of the 
algebraic equation 



s = [PcBR-^ B^ - A^]-^ QMx^ (14.244) 



This solution is frequently adopted in actual practice. 

Lin (1994) recommends choosing, as a first approach, diagonal criterion 
weighting matrices with their diagonal terms equal to 

= ^* = l/K)Lx (14.245) 



in order to realize a compromise between the input variations and the perfor- 
mance with respect to the output, while Anderson and Moore (1990) propose 
taking 



Qi = 






zfdt 



Ti = 



T 



ujdt 



(14.246) 



It frequently happens, as in the simple linear example previously treated, that 
some components of the control vector are bounded 



In this case, the optimal control is equal to 

u* =sat(i^■^B^V’) 



(14.247) 



(14.248) 
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defining the saturation function by 



sat(ix^) 



if* ^ '^'i,max 

if* — ^i,ma:x. 



(14.249) 



Example 14.6: Linear Quadratic Control of an Extractive Distillation 
Column 

Gilles and Retzbach (1983); Gilles et al. (1980) studied an industrial extractive 



Isopropanol 




Figure 14.7: Extractive distillation column of Gilles, with concentration and 
temperature profiles 

distillation column (Fig. 14.7) designed to separate water and isopropanol from 
the feed using glycol as an extracting agent, introduced near the top of the 
column. The product is purified isopropanol at the top of the column, while 
glycol leaves at the bottom. The reduced-order model developed by Gilles 
describes the concentration and temperature profiles in the column. At two 
places denoted by and stiff variations of concentration and temperature 
occur; at zi, the interfacial separation between water and isopropanol happens; 
at tfie interfacial separation between water and glycol happens, hence the 
allure of the profiles (Fig. 14.7). The position of these concentration and 
temperature fronts depends on the feed flow rate, the heating vapour flow 
rate and the side vapour flow rate. The feed composition and flow rate are 
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disturbances, while the heating vapour flow rate and the side vapour flow rate 
are the manipulated variables. 

Denoting by S the variations with respect to the steady state, the boiler 
dynamics is represented by the differential equations 



SQi = ail ^Qi + bii Sui 
5Vi = U21 5Qi + U22 



with the following notation: 

SQi’. heat flow rate at the boiler, 

5Vi'. vapour flow rate, 

5ui'. heating vapour flow rate. 

The variations of the fronts are represented by 

5zi = bs2 + fsi Sxfai + fs2 ^Fa 
5z2 = ^42 5S + /42 5Fa 



(14.250) 



(14.251) 



with: 

5S: side vapour flow rate, 

Sxfai'- feed composition, 

5Fa'- feed flow rate. 

The position of the fronts is determined by measuring the temperature by 
means of thermocouples located in the neighbourhood of the desired positions 
of the fronts according to a linear law 



5Ti = ci3 5zi 
ST2 = C24 ^Z 2 



(14.252) 



The continuous state-space model of the process is then 

X = Ax B u F d 
y = Cx 



(14.253) 



where d represents the disturbances and y the measurements. The vectors are 
dehned with respect to the physical variables by 



X = 



SQi 

SVi 

Szi 

SZ2 



u = 



Sui 

SS 



d = 



Sxfai 

SFa 



y = 



STi 

ST 2 



(14.254) 



A is a 4 X 4 matrix, S4x2, F4x2, C2x4. The time unit is hours and 
temperature unit is degrees Kelvin. Numerical data are the following (all other 
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elements of the matrices are zero) 



ail 


= -30.3 


611 = 6.15 X 10® 


hi 


= 62.2 


Cl3 = 


-7.3 


Ct 21 


= 0.12 X 10-3 


632 = 3.04 


fs 2 


= 5.76 


C24 = 


-25.0 


Ct 22 


= - 6.02 


642 = 0.052 


/42 


= 5.12 







^32 = -3.77 
(242 = — 2.80 



(14.255) 



In order to emphasize the influence of the weighting matrices of the qua- 
dratic index (14.216), linear quadratic control is employed for different values 
of the weighting matrices Q and R. Qj is chosen as zero and M is equal to C . 
Two set point step variations have been realized for each output. The set point 
is perfectly followed (Figs. 14.8 to 14.10). However, note that there is neither 
noise, nor disturbance. The responsiveness of the tracking increases when the 
weighting with respect to the control decreases, leading to large variations of 
the inputs, which could possibly saturate. The coupling of the multivariable 
system appears clearly at t = 2h and t = 4h, when the set point variations 
concern only one output, respectively yi and ^ 2 - 

If all the states are considered to be known and M is the identity matrix, 
the optimized LQ criterion is given by Eq. (14.215). In the case where Q and 
R are chosen as identity matrices, the solution of the algebraic Riccati equation 
is then 



1.626 X 10-® 


7.388 X 10“^ 


2.778 X 10-3 


-1.626 X 10-® 


7.388 X 10“^ 


3.786 X 10+3 


1.424 X 10+2 


-8.332 X 10+3 


2.778 X 10-s 


1.424 X 10+2 


5.694 X lO+o 


-3.137 X 10+2 


-1.626 X 10-® 


-8.332 X 10+3 


-3.137 X 10+2 


1.834 X 10+'* 



(14.256) 

and the corresponding steady-state gain 

_ r 9.999 X 10-1 4.544 x IQ-i 1.709 x 10-^ -9.999 x lO”! ' 

^ -8.221 X 10-11 -4.213 x lO-i 9.999 x lO-i 1.709 x 10-^ 

(14.257) 

These results concerning the algebraic Riccati equation have been obtained by 
the Hamiltonian method. Note that some well-known control codes may give 
erroneous solutions for this solving. This system is close to being uncontrollable. 

If only the measured states xs and X 4 are considered, M is equal to C and 
the optimized LQ criterion is given by Eq. (14.216). Again, in the case where 
Q and R are chosen as identity matrices, the solution of the algebraic Riccati 
equation is then 



3.292 X 10-12 


8.531 X lO-i" 


-3.269 X lO-i" 


-1.593 X 10-® 


8.531 X lO-i" 


2.224 X 10-1 


-9.472 X 10-2 


-4.169 X 10-1 


-3.269 X lO-i" 


-9.472 X 10-2 


3.237 X 10-1 


-8.621 X 10-2 


-1.593 X 10-® 


-4.169 X 10-1 


-8.621 X 10-2 


1.174 X lO+o 



(14.258) 
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and the corresponding steady-state gain 



2.025 X 10-® 5.247 x IQ-^ -2.011 x lO"! -9.796 x IQ-^ ' 

-1.077 X 10-® -3.096 x lO"! 9.796 x lO"! -2.010 x IQ-^ 

(14.259) 




Time Ch) 




Time Ch) 



Figure 14.8: Linear quadratic control of Gilles extractive distillation column 
(criterion weighting: Q = R = I). Top: input ui and output y\. Bottom: 
input U 2 and output y 2 
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14.6.2 Linear Quadratic Gaussian Control 

In linear quadratic control, such as was previously discussed, the states are 
assumed to be perfectly known. In fact, this is seldom the case. Indeed, often 
the states have no physical reality and, if they have one, frequently they are 
not measurable or unmeasured. Thus, it is necessary to estimate the states to 
use their estimation in the control model. 

In continuous time, the system is represented in the state space by the 
stochastic linear model 



( x{t) = Ax{t) + Bu{t) +w{t) (^Aom^ 

X y{t)=Cx{t)+v{t) 

where w{t) and v{t) are uncorrelated Gaussian white noises, respectively of 
state and measurement (or output), of the respective covariance matrices 

E {ww^} = PE > 0 , E {vv^} = V > 0 (14.261) 

Denote by x the state estimation, so that the state reconstruction error is 
e{t) = X — X. An optimal complete observer such as 

X = A x{t) + B u{t) + K f{t)[y{t) — C x{t)] (14.262) 

minimizes the covariance matrix of the state reconstruction error, thus 

E {{x — x)Pw{x — x)^} (14.263) 

where is a weighting matrix (possibly, the identity matrix). 

Kalman and Bucy (1961) solved this problem and showed that the estimator 
gain matrix K f is equal to 



Kf{t)=Pf{t)C^V-^ (14.264) 

where Pf{t) is the solution of the continuous differential Riccati equation 
Pf{t) =APf{t)+Pf{t)A^-Pf{t)C^V-^CPf{t) + W 

(14.265) 

with: Pf{to) = Pq 

Moreover, the initial estimator condition is 



x{to) = xo (14.266) 

The Kalman-Bucy filter thus calculated is the best state estimator or ob- 
server in the sense of linear least squares. It must be noticed that the de- 
termination of the Kalman filter is a dual problem of the linear quadratic 
optimal control problem: to go from the control problem to the estimation 
one, it suffices to make the following correspondences: A B 

M^QM ^ W, R ^ V, Pc Pf ] on the one hand, the control Riccati 
equation progresses backwards with respect to time, on the other hand, the 
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estimation Riccati equation progresses forwards with respect to time. This lat- 
ter remark obliges us to carefully manipulate all time-depending functions of 
the solutions of the Riccati equations (Kwakernaak and Sivan, 1972): Pc{t) 
(control problem) is equal to P f {to tf — t) (estimation problem), where to is 

the initial time of the estimation problem and tf the final time of the control 
problem. 

When the estimation horizon becomes very large, in general the solution of 
the Riccati Eq. (14.265) tends towards a steady-state value, corresponding to 
the solution of the following algebraic Riccati equation 

APf+PfA^-PfC’^V-^CPf + W=0 (14.267) 

giving the steady-state gain matrix of the estimator 

Kf = PfC'^V-^ (14.268) 

Kwakernaak and Sivan (1972) detail the conditions of convergence. For reasons 
of duality, the solving of the algebraic Riccati equation (14.267) is completely 
similar to that of Eq. (14.234). 




Figure 14.11: Structure of linear quadratic Gaussian control 

Consider the general case of tracking. The control law similar to (14.243) 
is now based on the state estimation 

u*(t) = -R-^ Pc(t) x(t) + R-^ s(t) = -Kcx{t) + Uff(t) (14.269) 

The system state equation can be written as 

x{t) = A x{t) — B Kc{t) x{t) A Buff{t) Aw{t) (14.270) 

so that the complete scheme of the Kalman filter and state feedback optimal 
control (Fig. 14.11) is written as 



X 




A -BKc 




X 


+ 


' Buff{t)+w 


X 




KfC A- KfC - B Kc 




X 


_ Buffit) + KfV _ 



(14.271) 
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which can be transformed by use of the estimation error e{t) = x{t) — x{t) 



X 

e 

The closed-loop eigenvalues are the union of the eigenvalues of the state feed- 
back optimal control scheme and the eigenvalues of Kalman hlter. Thus, it is 
possible to separately determine the observer and the state feedback optimal 
control law, which constitutes the separation principle of linear quadratic Gaus- 
sian control. This property that we have just verihed for a complete observer 
is also verihed for a reduced observer. 

It is useful to notice that the Kalman hlter gain is proportional to P (which 
will vary, but must be initialized) and inversely proportional to the measure- 
ment covariance matrix Y. Thus, if Y is low, the hlter gain will be very large, 
as the conhdence in the measurement will be large; the risk of low robustness 
is then high. The Kalman hlter can strongly deteriorate the stability margins 
(Doyle, 1978). The characteristic matrices of the Kalman hlter can also be 
considered as tuning parameters. It is also possible to introduce an integrator 
per input-output channel in order to effectively realize the set point tracking; 
the modelled system represents, in this case, the group of the process plus the 
integrators. It is possible to add an output feedback (Lin, 1994) according to 
Fig. 14.12, which improves the robustness of regulation and tracking. 



A-BKc 

0 



BKc 

A-KfC 



X 

e 



+ 



Buff(t)+w 
w — KfV 



(14.272) 




Figure 14.12: Structure of linear quadratic Gaussian control with added out- 
put feedback 

An important development in LQG control is the taking into account of 
robustness so as to satisfy frequency criteria concerning the sensitivity and 
complementary sensitivity functions. Actually, the stability margins of LQG 
control may reveal themselves to be insufficient. LQG/LTR introducing loop 
transfer recovery (recovery of the properties at the process input) (Stein and 
Athans, 1987), (Maciejowski, 1989) allows us to solve this very important type 
of problem in state-space multivariable control. 
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Example 14.7: Linear Quadratic Gaussian Control of an Extractive 

Distillation 

Column 

Again, we consider the control of the Gilles extractive distillation column, pre- 
viously treated in the case where all the states are known. Now, suppose that 
the noise w{t) bearing on the state derivatives is Gaussian and has a standard 
deviation equal to 0.2, thus constituting a model error. The measurement noise 
is Gaussian and has a standard deviation equal to 0.5 . The initial states are 
all taken to be equal to 0.1 . 

Gonsider the following values of the criterion weighting: Q = J ; R = I 
and the covariance matrices: V = 0.04 J; W = 0.25 J. Thus, the solution of 
the algebraic Riccati equation obtained from the Hamiltonian matrix method 
and concerning the Kalman estimator is 



Pf = 

giving 



4.125 X 10-3 


1.339 X 10-3 


-1.008 X 10-9 


-3.999 X 10-19 - 


1.339 X 10-3 


1.912 X 10-2 


-2.818 X 10-3 


-7.681 X 10-1 


-1.008 X 10-9 


-2.818 X 10-3 


1.426 X 10-2 


1.315 X 10-1 


-3.999 X lO-io 


-7.681 X 10-* 


1.315 X 10-1 


4.034 X 10-3 








(14.273) 


le steady-state gain matrix of the 


Kalman estimator 




■ 1.839 X 10-^ 


2.499 X lO-i" 






5.143 X 10-1 


4.801 X 10-1 




= 


-2.603 X lO+o 


-8.218 X 10-2 


(14.274) 




-2.399 X 10-2 


-2.521 X 10+9 





The values of the weighting matrices of the criterion and the covariance 
matrices are indicated in each case in the legends of Figs. 14.13 to 14.16. 
Figures 14.13 and 14.14 correspond to a state feedback control where no special 
account is taken of the measured output, while in Figs. 14.15 and 14.16, an 
output feedback acting on the reference, as indicated in Fig. 14.12, has been 
used. Naturally, the latter type of control gives better set point tracking. 
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Figure 14.13: Linear quadratic Gaussian control of Gilles extractive distil- 
lation column (criterion weighting: Q = J; R = I. Initial Pf{0) = 0.1 J. 
Govariance matrices: V = 0.04/ ; W = 0.25/). Top: input ui and output y\. 
Bottom: input U 2 and output y 2 




Time Cti> 




Time 



Figure 14.14: Linear quadratic Gaussian control of Gilles extractive distilla- 
tion column (criterion weighting: Q = / ; = 0.1/. Initial Pf{0) = 0.1/. 

Govariance matrices: V = 0.04/ ; W = 0.25/). Top: input ui and output y\. 
Bottom: input U 2 and output y 2 
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Time Ch) 




Figure 14.15: Linear quadratic Gaussian control of Gilles extractive distilla- 
tion column, with output feedback (criterion weighting: Q = J ; R = I. Initial 
P/(0) = 0.1 J. Govariance matrices: V = 0.04J; W = 0.25 J). Top: input 
ui and output y\. Bottom: input U 2 and output y 2 




Time Ch) 




Figure 14.16: Linear quadratic Gaussian control of Gilles extractive distilla- 
tion column, with output feedback (criterion weighting: Q = J; P = 0.1 J. 
Initial P/(0) = 0.1 J . Govariance matrices: V = 0.04J; W = 0.25 J). Top: 
input u\ and output y\. Bottom: input U 2 and output y 2 
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14.6.3 Discrete-Time Linear Quadratic Control 



In discrete time, the system is represented in the state space by the determin- 
istic linear model 



ccfe+i = Fxk + Guk 
Vk = Hxk 



(14.275) 



The sought control must minimize a quadratic criterion similar to Eq. 
(14.216) thus 



J — 0.5 [z^ — zpf]'^ Q [z^ — zpf] 



N-l 



-F0.5 {[zl~ Zkf Q [zl - Zk] +ulRuk] 

k=0 

= 0.5 - xn]^ QnM [x^j^ - xn] 

A^-l 

+0.5 E{K - Xk]'^ QM[xl~ Xk] + ul itMfej 

k=0 



(14.276) 



where the matrices Q are semipositive dehnite and R is positive dehnite. 
Furthermore z = Mx represents measurements or outputs. It would be pos- 
sible to use variational methods to deduce from them the optimal control law 
(Borne et ah, 1990; Lewis, 1986), which would provide a system perfectly sim- 
ilar to Eq. (14.223). However, variational methods are a priori designed in the 
framework of continuous variables, thus for continuous time; on the other hand, 
dynamic programming is perfectly adapted to the discrete case. Thus, we will 
sketch out the reasoning in this framework. For more details, it is possible, for 
example, to refer to Dorato and Levis (1971); Foulard et al. (1987). 

The system is considered at any instant i included in the interval [0,A^], 
assuming that the policy preceding that instant, thus the sequence of the 
{uk^k G [i + l,A^]}, is optimal (the hnal instant N is the starting point for 
performing the procedure of dynamic programming). In these conditions, the 
criterion of interest is in the form 

Ji — 0-5 [z^ — Ztv]^ Q n ~ ^n\ 

A^-l 

-F0.5 y; { [4 - Zkf Q [zl - Zk] + ul Ruk} 

k=i 

A^-1 

= 0.5 [xlf - xn]'^ Qjv M [xlf -xn]+'^ Lk{xk, Uk) 

k=i 

with the revenue function Lk dehned by 

Lk{xk,Uk) = 0.5 I [4 - Xkf M'^ Q M [xl - Xk] + + 

According to the Bellman optimality principle, the optimal value 
criterion can be expressed in a recurrent form 

J* =nun [o.5[xl -Xif QM[xl -Xi]+uf Rui + 



(14.277) 

(14.278) 
J* of the 

(14.279) 
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The final state is assumed to be free. It is necessary to know the expression of 
Jij^i with respect to the state to be able to perform the minimization. Reason- 
ing by recurrence, suppose that it is in quadratic form 

J*+i = 0.5 {a;^i Si+i Xi+i Xi+i + hi+i] (14.280) 

hence 



— 0.5 {[F Xi G SiJ^i [F Xi G Ui 
+2 [F Xi -\- G Ui] + hi-^i } 



We deduce 



J* = 0.5 minui - Xi]"^ Q M [xl - Xi] -^uf Rui 

^[Fx, + Gu,f ^,+1 [Fx, ^Gu,] (14.282) 

+2 Qi_^i [F Xi -\- G Ui] + hi-^i } 



We search the minimum with respect to Ui thus 



R < + G^ [Fx,^G <] + G^ = 0 (14.283) 



or 

< = ~[R+G'^ Si+iG]-^[G'^ Si+iFxi + G'^gi^^] (14.284) 

provided that the matrix [R+G'^ Si+\ G] is invertible. Notice that the optimal 
control is in the form 

u; = -K,x,^k,g,^^ (14.285) 

revealing the state feedback with the gain matrix Ki and feedforward with the 
gain ki. Thus, we set 

K, = [i^ + G^ ^,+1 G] - 1 ^,+1 F ; Sn = M 

rj^ 1 rj^ (14.286) 

ki =—\R^G (j] G 

It is then possible to verify that J* is effectively in quadratic form; thus we 
find 

Si = QM + Si+i{F -GKi) 

^ ^ (14.287) 

Qi =-M^ Qz\ + {F^ -GKif Qi+^- gj, = MQj,z-j, 

The group of Eqs. (14.285), (14.286), (14.287) allows us to determine the inputs 
u. When not all states are known, of course it is necessary to use a discrete 
Kalman filter which will work with the optimal control law according to the 
same separation principle as in the continuous case. 

It can be shown that Eq. (14.287) is equivalent to the discrete differential 
Riccati equation 

Si = {F-G Kif Si+i {F -GKi)+Kj RKi + Q M (14.288) 
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presented in Joseph form and better adapted to numerical calculation. 

Let us apply the Hamilton- Jacobi principle to the discrete-time optimal 
regulator. If the control law n* and the corresponding states are optimal, 
according to the Hamilton- Jacobi principle, there exists a costate vector '0* 
such that n* is the value of the control Ui which maximizes the Hamiltonian 
function Ha 



Ha — 5 '^i) 4“ i-\-l ^'i+l 

= -Li{x*,Ui) +'ip*h [Fxi +Gui] 

= -0.5 { [< -XiY M^QM [< -Xi] + uJ Rui'^ + [Fxi + G m] 

(14.289) 

The Hamilton- Jacobi conditions give 
BH 

ri=-^=M^QM[x* -x^] -F^ lA.+i with: ^^=M^QM [x% -x^^] 

(14.290) 

and the control which maximizes the Hamiltonian function H„ is such that 



dH 

= Q^ -Rui+G'^ rAi+i = Q^u*=R-^ G^ rAi+i 



hence 



Xi+i = Fxi + Gui = Fxi + GR ^ G^ i/’i+i 



Introduce the matrix H such that 

= n 



Xi 



^i+i j 



^i+i j 



= H~^ 



Xi 



(14.291) 

(14.292) 

(14.293) 



Assume x^ = 0 for the regulation case. The two conditions (14.290) and 

(14.291) can be grouped as 



=M^QMx* 

Xi+i=Fxi+GR 



from which we deduce the matrix H 

^—1 F~^ G R~^ 

^ ^ Q M F~^ -F’^ - Q M F~^ G R~^ G^ 



(14.294) 



(14.295) 



In the case where a steady-state gain K^o is satisfactory, which can be 
realized when the horizon N is large, the gain matrix can be obtained after 
solving the algebraic Riccati equation 

S = F^[S-SG{G^ SG + R)-^G^ S]F + MQM (14.296) 

whose solution (corresponding to discrete time) is obtained in a parallel manner 
to the continuous case, by hrst considering the matrix Ti. Its inverse is the 
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symplectic^ matrix 1~L ^ equal to 






F + G R-^ F~^ QM -G R~^ G^ F~^ 

_ F~^ M^QM -F“^ 



(14.297) 



We seek the eigenvalues and associated eigenvectors of 7i~^. Then, we form 
the matrix U of the eigenvectors so that the hrst n columns correspond to the 
stable eigenvalues (inside the unit circle), in the form 



Un Ui2 
U 21 U 22 



(14.298) 



where the blocks U ij have dimension nxn. The solution of the discrete Riccati 
algebraic equation (14.296) is then 

Soo = U21 ny (14.299) 



giving the steady-state matrix. 

For the tracking problem, in parallel to the stationary solution for the gain 
matrix iT, the stationary solution for the feedforward gain is deduced from Eq. 
(14.287) and is given by 

gi = [I- (F^ - G l-M^ Q <] (14.300) 

The use of stationary gains provides a sub-optimal solution but is more 
robust than using the optimal gains coming from Eqs. (14.286) and (14.287). 

The discrete linear quadratic Gaussian control is derived from the previously 
described discrete linear quadratic control by coupling a discrete linear Kalman 
hlter in order to estimate the states. 

Remark 

The operating conditions of quadratic control have not been discussed. In 
general, it is assumed that the pair (A, B) in continuous time, or (E", G) in 
discrete time, is controllable. Moreover, when the horizon is inhnite and when 
we are looking for steady-state solutions of the Riccati equation, the condition 
that the pair (A, \/Q) continuous time, or (E", \/Q) discrete time, is 
observable (the notation H = ^/Q means that Q = H) must be added. 
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Chapter 15 

Generalized Predictive 
Control 



The principle of predictive control makes it attractive for many applications, 
either as linear or nonlinear control. In this chapter, only linear generalized 
predictive control is studied. Model predictive control is treated in Chap. 16 
where linear model predictive control is discussed in two forms and nonlinear 
predictive control is also discussed. The latter (Rawlings et ah, 1994) is often 
treated as an optimization problem with constraints on the state and on the 
input, where we seek a control minimizing a technical-economic criterion. 



15.1 Interest in Generalized Predictive Control 

Generalized predictive control (GPC) proposed by Clarke et al. (1987a,b) is 
among the control methods that are usable for adaptive control. To be used, 
it must be coupled with an identihcation method of the process model; if the 
identihcation is realized on-line, it is adaptive control. Bitmead et al. (1990) 
devoted a entire book to GPC and, in particular, studied the interaction be- 
tween parametric identihcation and the choice of control scheme, insisting on 
the interest in combining them for the robustness of the system rather than 
exciting oneself in a very deep aspect either of identihcation or of control, even 
more so as identihcation will have to be realized in closed loop. Bitmead et al. 
(1990) even qualify of synergy the relation between recursive least-squares iden- 
tihcation and linear quadratic control, i.e. the robustness of the combination 
is better than that obtained by applying them separately. Also, the book by 
Camacho and Bordons (1998) deals mainly with GPC. It also presents multiva- 
riable GPC and includes several examples. On the other hand, this method has 
been used successfully in industrial applications in different forms (Clarke, 1988; 
Defaye et ah, 1993; LeLann et ah, 1986; Rahlamanana et ah, 1992). Among 
the claimed advantages of GPC, Clarke et al. (1987a) mention that it can be 
applied to processes presenting a variable time delay, to nonminimum-phase 
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Figure 15.1: Principle of extended horizon predictive control 



processes, and that it poses no apparent problem when the process model in- 
cludes too many parameters, contrary to pole-placement or linear quadratic 
control. 

Predictive control owes its name to the fact that it takes into account pre- 
dictions of the process future outputs and sometimes also of the future inputs. 
The outputs and the inputs are predicted over a finite horizon. 



15.2 Brief Overview of Predictive Control 
Evolution 

Predictive control did not appear suddenly, but rather appeared as an evolution 
through the minimum-variance controller (Astrom, 1970) and the self-tuning 
controller (Astrom and Wittenmark, 1973) obtained by minimization of the 
criterion 

T(ix,t) = E|[^(t + d+ l)— r(t + d+ l)]^} (15.1) 

where r is the reference signal to be tracked by the output and d the delay of the 
output with respect to the input (excluding the minimum unit time delay of the 
output with respect to the input). Minimization of the criterion gives the input 
u{t) to apply. The controller thus obtained is only convenient for minimum- 
phase systems. In order to extend it to nonminimum-phase systems, generalized 
minimum- variance control was proposed by Clarke and Gawthrop (1975, 1979), 
by introducing a penalty on the input, thus giving the new criterion 

J(ix, t) = E {[y{t + d + l) — r(t + d + l)]^ + Xu{t)^} (15.2) 

or furthermore 



J(ix, t) = E { [y{t + d + 1) — r(t + d + 1)]^ + AAix(t)^} 



(15.3) 
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by introducing the input variation Au{t) to ensure a zero steady- state error in 
the case of a constant nonzero reference. Ydstie (1984) modihed this control 
by introducing an extended horizon, applicable to nonminimum-phase systems, 
but not to open-loop unstable systems. The nearest control to GPC is predic- 
tive control by Peter ka (1984) for an inhnite prediction horizon, hence a dif- 
ferent development. Clarke et al. (1987a, b) introduced generalized predictive 
control, which minimizes the following criterion 

{ N2 "I 

Y [y(^+j) - r{t + j)]'^ + j - 1)]'^ I (15.4) 

j=Ni j = i I 



where Ni and N 2 are the minimum and maximum cost horizons and Nu the 
control horizon (Fig. 15.1); these horizons are hnite. \{j) is a weighting se- 
quence for the input. In fact, only the hrst calculated input u{t) is applied 
and the following inputs u{t + 1), • • • are only calculated in open loop. The 
minimization is thus repeated at the following step. The calculation of the in- 
put necessitates us knowing the future set points, and the aim of the control is, 
owing to the output predictions, to bring the future outputs as close as possible 
to the set points. The most important parameter is the prediction horizon N 2 . 
In general, the control horizon is chosen to be small. 



15.3 Simple Generalized Predictive Control 

15.3.1 Theoretical Presentation 

In this section, the described method is the original method of Clarke et al. 
(1987a,b), restricted to the single-input single-output case. 

The considered system model is the following 

- 1) + (15-5) 

based on the use of the backward shift operator q~^. This is an auto- regressive 
integrated moving- average exogenous input (ARIMAX) model and ^{t) is white 
noise of zero mean. The polynomials are 

A{q~^) = 1 + + . . . + 

B{q =bo+biq ^+... + bntq 
C{q~^) = 1 + + . . . + 

A(^-i) =l-q-^ 

The natural time delay of the output with respect to the input is integrated 
in the model writing; if the process presents a zero time delay d, the hrst d 
elements of are zero. Notice that the polynomials A{q~^) and C{q~^) 

are monic. 

The model (15.5) can also be considered as 

A{q~^) Ay{t) = B{q~^) Au{t - 1) + C{q~^) ^{t) 



(15.7) 
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making use of the variations in the input and the output. 

We seek the inputs u minimizing the criterion (15.4). To solve this opti- 
mization problem, we need a j-step ahead predictor of the output (Favier and 
Dubois, 1990) depending on the past information and on the future input vari- 
ations Au which will be calculated, so that the criterion is optimal. Such a 
predictor is based on the solving of the Diophantine equation 

C{q-^) = Ej{q-^) A{q-^)^{q-^) + q~i Fj{q~^) (15.8) 

where the polynomials Ej and Fj only depend on C{q~^) and on degree 

j. Moreover, the degree of Ej is such that: deg{Ej) = j — 1. Equation (15.8) can 
be seen as the division of C{q~^) by q~^ with quotient Fj{q~^) and remaining 
This polynomial equation can be favourably solved in 
a recurrent manner (Acundeger and Favier, 1993), (Favier and Dubois, 1990). 
It is also possible to use a bank of predictors. 

Using the model (15.7) and Eq. (15.8), we obtain 

y{t ^ ^ E j - 1) E Ej + j) + (0 (15.9) 

which we still can transform by replacing the known terms ^{t) with respect to 
the model (15.5), and again using Eq. (15.8), hence 

y(t + j) = - '^) + Ej ^(t + j) (15.10) 

Noticing that the term ^{t j) is independent of the past information at time 
t, the predictor results 

y(t + j) = W + + J - 1) (15.11) 

We then introduce a second Diophantine equation 

Ej{q-^) B{q-^) = Gj{q-^) C{q~^) + q~^ Tj{q-^) (15.12) 

to separate the past inputs from the future inputs (Bitmead et ah, 1990). This 
is performed in the same way as in Eq. (15.8) with deg{Gj) = j — 1. The 
predictor expression (15.11) becomes 

y(t+j) = + ^ ^u(t - 1) + Gj{q~^) Au(t + j - 1) (15.13) 

which is written as 

y(t+j) = Ejyf(t) + Tjuqt - 1) + Gj{q~^)Au(t+j - 1) (15.14) 

by introducing the variations in the hltered input and the hltered output 



uf{t) =l/G{q-j Au{t) 
yf{t) =l/G{q-jy{t) 



(15.15) 
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It is typical (Camacho and Bordons, 1998) to consider the control sequence 
formed by the sum of a free signal and a forced signal, which will induce the 
free and forced responses respectively. The free signal corresponds to the past 
inputs and is kept constant and equal to its last value at future time instants. 
The forced signal is taken to be equal to zero at past instants and equal to the 
control variations for future instants (Fig. 15.2). 




Figure 15.2: Decomposition of the control sequence into the sum of a free 
signal and a forced signal and decomposition of the resulting response into free 
and forced responses 



From Eq. (15.14), results the prediction of the free response y{t-\-j\t) obtained 
by assuming that all the future input variations will be zero 

y(t+j\t) = FjvHt) + - 1) (15.16) 

SO that the predictor is decomposed into two parts 

y(t+j) = y(t + j\t) + Au(t + j - 1) (15.17) 



The polynomials Gj and F^ can be calculated in a recurrent manner from Eq. 
(15.12). It can also be noticed that the coefficients gi of polynomial Gj{q~^) 
are the hrst j terms of the response to a step of the system model B/{AA), 
thus are the Markov parameters of this transfer function, as indicated by the 
following equation 



B 

~AA 



Gj + q 




-j 



BF. 

AAG 



(15.18) 



obtained from both Diophantine Eqs. (15.8) and (15.12). 

Dehning the vector / of the predictions of the free response 



/ = [y{t + ■ ■ ■ ^yit + (15.19) 

and the vector u of the future input variations 

u = [Au{t), . . Au{t + Nu - 1)]^ 



(15.20) 
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the vector y of predictions defined by 

y = [y(t + l),...,y(t + N 2 )f (15.21) 



is equal to 



y = Gu^ f 



(15.22) 



where the matrix G of dimension N 2 x Nu contains as elements the first j 
coefficients gi of polynomials Gj as 



" do 


0 


... 0 


9 i 


9 o 


... 0 


9n^-i 


9n^-2 


• • • 9 o 


_ 9N2-1 


9N2-2 


• • • 9 N 2 -n^ 



(15.23) 



If the process time delay is larger than the natural delay: d > 1, the first d — 1 
rows of G are zero. It is possible to choose Ni = d to avoid it. Nevertheless, 
Clarke et al. (1987a) notice that GPC provides a stable solution even when the 
first d — 1 rows of G are zero. 

The quadratic criterion to be minimized becomes 



J{u,t) = E |(y - r)^ {y - r) ^ 

= {Gu -\- f — rY' (G n + / — r) + A iY u 



(15.24) 



assuming the series \{j) is constant and setting the reference vector r equal to 



r = [r{t + 1), . . . , r(t + ^^ 2 )]^ (15.25) 



Assuming that no constraints on the input exist, we obtain the vector of 
the future input variations, solution of this problem of minimization of the cost 
criterion 

M= [G^G +AJ]-^G^(r-/) (15.26) 

In fact, only the first input is really applied; thus, we obtain 

u{t) = u{t - 1) + g'^ {r - f) (15.27) 

where g is the hrst row of matrix [G^ G + A J]“^ G^. 

According to Clarke et al. (1987a), the control horizon Nu is a very impor- 
tant parameter. Nevertheless, it is often sufficient to take it to be equal to 1, 
except for complex systems where Nu will have to be chosen to be equal to the 
number of unstable or badly damped poles. To bring more damping, A must 
be increased. GPC can be used for a nonminimum-phase system, even in the 
case where A = 0. 

The relation between GPC and pole-placement control as well as linear 
quadratic control was studied by Clarke et al. (1987a,b) and particularly by 
Bitmead et al. (1990). 
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15.3.2 Numerical Example : Generalized Predictive 
Control of a Chemical Reactor 

Example 15.1: Generalized Predictive Control of a Chemical Reactor 



The studied case concerns the chemical 
reactor described in Sect. 19.2. The 
identihcation was realized by recursive 
least squares, taking Au and Ay as 
components of the observation vector, 
in order to model the process by an 
ARIMAX model of Eq. (15.7). 

Outlet 

The polynomials A and 5, whose coefficients have been simplihed to facilitate 
the numerical demonstration, are equal to 

A{q-^) = l-0.97q~^ ; = 1.2 + 0.58 (15.28) 

while C is equal to 1 . 

In the case where the control horizon and the prediction horizon are 
both equal to 3, we obtain, as a solution of the first Diophantine Eq. (15.8), 
the following polynomials Ej and Fj 

j = l Ei = l Fi = 1.97-0.97^-1 

j = 2 ^2 = 1 + 1.97^-! F 2 = 2.9109 - 1.9109^-1 (15.29) 

j = 3 Ea = 1 + 1. 97^-1 +2.9109^-2 F 3 = 3.8236 - 2.8236^-1 

hence the predictors according to Eq. (15.11) 

y{t + 1) = 1.97y(t) - 0.97y(t - 1) + 1 . 2 Au(t) + 0.58Au(t - 1 ) 
y\t + 2) =2.9109y(t) - 1.9109y(t - 1) 

+1.2Au(t + 1) + 2.944Au(t) + 1.4426Au(t - 1 ) 
y(t + 3) =3.8236y(t) - 2.8236y(t - 1) 

+1.2Au(t + 2 ) + 2.944Au(t + 1) + 4.6357Au(t) + 1.6883Au(t - 1 ) 

(15.30) 

For the second Diophantine Eq. (15.12), the polynomials Gj and Ej are 

j = l Gi = 1.2 El = 0.58 

j = 2 G 2 = 1-2 + 2.944^-1 E 2 = 1.1426 (15.31) 

j = 3 G 3 = 1.2 + 2.944^-1 + 4.6357^-2 E 3 = 1.6883 

The predictors of the free responses are calculated from Eq. (15.16) 

y(t + l|t) = 0.58Au(t - 1) + 1.97y(t) - 0.97y(t - 1 ) 

y\t + 2 |t) = 1.4426Au(t - 1) + 2.9109y(t) - 1.9109y(f - 1 ) (15.32) 

y\t + 3|t) = 1.6883Au(t - 1) + 3.8236y7) “ 2.8236y(f - 1 ) 
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The matrix G is equal to 



"1.2 0 0 
G = 2.944 1.2 0 

4.6357 2.944 1. 



and the gain matrix of the control law (for A = 0.1) 



\G'^G + \I]-^G'^ 



0.5181 0.1823 -0.0435 

-1.0888 0.1876 0.1823 

0.6262 -1.0888 0.5181 



(15.33) 



(15.34) 



so that the control law in an explicit form is 

Art(f) = 0.5181[r(t+l)-y(t + l|t)]+0.1823[r(t + 2)-y(t + 2|t)] 

-0.0435[r(t + 3) -y(t + 3|t)] 

= -0.490Art(f - 1) - 1.385y(f) + 0.7281y(f - 1) 

+0.5181r(f + 1) + 0.1823r(f + 2) - 0.0435r(f + 3) 

(15.35) 

The polynomials R, S and T corresponding to the pole-placement equivalent 
of this control (Fig. 15.3) are, approximately 



i? = 1.385 - 0.728g-i ; S' = 1 -f 0.4354^“ ^ 

T = 0.5181^-2 0.1823^-1 - 0.0435 

The simulation results are given and commented on in Sect. 15.6. 



(15.36) 



15.3.3 GPC seen as a Pole- Placement 

As only the hrst row of matrix \G"^ G + \I\~^G"^ is used for the effective control 
u{t), the elements of this row are particularized and will be denoted by §i in 
this section (i = 1, . . . , V 2 ). From Eq. (15.26), we deduce the control variation 






- 1) + r{t + .) - 



(15.37) 



which we can transform into the classical RST form of pole-placement (Fig. 
15.3) 






^9iFi > y{t) 



SAu{t) = G{q-^)Tr{t + N 2 ) - Ry(t) 



(15.38) 
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by setting the polynomials i?, S, T of the pole-placement 



N2 

R ='^SiFi 

i=l 

N2 

s + (15.39) 

i=l 

N2 

T 

i=l 




Figure 15.3: Generalized predictive control considered as pole-placement 



GPC can thus be presented as a particular pole-placement minimizing a crite- 
rion. Using the process model (15.5), we have 

(AAS' + q-^BR)y(t) = BCTr{t + N 2 - 1) + CS^(t) (15.40) 



as on the other hand, by expressing R and S, and using Eq. (15.18), we have 



AAS + q-^BR 



C 



N2 

AA + y ] giq^ ^ {B — AAGi) 

i=l 



(15.41) 



and we obtain 



|aA + {B - AAGi)^y{t) = BTr(t + N2~1) + S^t) 

Acy{t) = BTr{t + N2~1) + 

(15.42) 

which shows that the closed-loop output does not depend on polynomial G. 
The stability condition is that the polynomial Ac has its roots inside the unit 
circle. The influence of parameters A is complex. It is easy to verify 

that the steady-state gain of the closed-loop transfer function is equal to 1. 
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15.4 Generalized Predictive Control with 
Multiple Reference Model 

15.4.1 Theoretical Presentation 



This variant of GPC was introduced by Irving et al. (1986). It introduces a 
different specification for the regulation dynamics and the tracking dynamics, 
and thus allows us to improve the robustness. 

On the other hand, the reference model is given by the equation 

= S'-''*’ 

where yc is the set point, i.e the input of the reference model, the 

stable monic polynomial fixing the tracking dynamics and T a scalar ensuring 
a unit gain. 

On the other hand, define Ur{t) as the control which would give an output 
y{t) equal to the reference r{t) in the absence of noise. From the system transfer 
function /A{q~^), we deduce 

= + (15.44) 

Then, we introduce the respective deviations of the input and the output 
with respect to their reference 



e'^(t) = A{u{t) — 
eV{t) = y(t) - r(t) 



(15.45) 



These deviations are filtered by a stable monic polynomial Af{q ^), which 
modifies the regulation dynamics 



(t) = Af{q ^)e'^(t) 
(t) = Af{q~^)ey (t) 



(15.46) 



The optimization criterion (15.4) is replaced by a criterion depending on 
the filtered deviations 



) N2 I 

^ [e^yt+j)]'^+'^\{j)[e^yt+j-l)]'^ i (15.47) 

j=Ni j = i ) 

with: e“-^(t +j) = 0 (i = V 2 ). The method is completely parallel to simple 

GPC where the process model is replaced by the following performance model 

AAe^f = - 1) + AfC^{t) (15.48) 

The developed equations will be very close. The two Diophantine equations to 
solve are 

C{q-^) = E,{q-^) A{q~^) A{q~^) + q~^ F,{q~^) 

Ej{q-^) B{q-^) = Gj{q-^) Af{q~^) C{q~^) + q~i Tj{q-^) 



(15.49) 
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The predictions are expressed with respect to deviations 

+ - 1) + Gj{q~^) e^f{t+j - 1) (15.50) 

AfC AfC 

giving the predictions of the deviations of the free responses 

e^Ht+j) = - 1) (15-51) 

The following vectors are then dehned 

/ = [eyf[t + l\t),...,eyf{t + N2\t)]'^ 

= [e“l(t), . . . , e“l(t + AT„ - 1)]^ (15.52) 

ev = [eyf{t + l),...,eyf{t + N2)f 

These vectors are related by the relation 

ey = Ge^^f (15.53) 

and the quadratic criterion to minimize is 

J = ev'^ev + (15.54) 

hence the solution vector 

= -[G^G + A (15.55) 

whose hrst component only is applied for the real control. With the matrix G 
being the same as for GPC, the elements gi are identical and we obtain 

N2 

e^f{t) = -'^gjiygt+j\t) (15.56) 

J = 1 



which can be expressed as 



N2 

J=1 

SA(u(t) — = 

(t) 

Again. GPC with a reference model can be 
placement minimizing a criterion (Fig. 15.4), 



N2 

-R{y{t) -r{t)) 

-Reyf{t) 

(15.57) 

considered as a particular pole- 
by simply setting 



5^ 



N2 

AfC ^ Tj 

j=i 

N2 

9j 

j=i 



R 



(15.58) 
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Figure 15.4: GPC with performance model seen as a pole-placement 



The closed-loop equation is very similar to Eq. (15.40) and is equal to 

{AAS + q-^BR)y(t) = {AAS + q~^BR)r{t + N 2 - 1) + CS^(t) (15.59) 



thus we can draw 



y{t) 



BT , , CS 

AAS + q-^BR^^^^ 



(15.60) 



which displays the stability condition that the polynomial A/\S + q~^BR has 
its roots inside the unit circle. In fact, after development of R and S, and use 
of both Diophantine equations, this polynomial can be expressed as 



AAS^q-^BR 



N 2 



— AfC [AA + ^2 

j=i 

= AfCA, 



\B-AAGj)] 



(15.61) 



The output is thus 

HR 9 

y{t) = -^yS) + (15.62) 

/±m 

which emphasizes the role played by the polynomial A f acting on the regulation 
dynamics, and by A^ acting on the tracking dynamics. Note that polynomial 
Ac depends, in fact, only on matrix G and parameter A. It is advisable to 
choose the parameters so that the dynamics of polynomial Ac is negligible 
(fast) compared to those of A f and A^. 



15.4.2 Numerical Example: Generalized Predictive 

Control with Performance Model of a Chemical 
Reactor 

Example 15.2: Generalized Predictive Control with Performance Model 
of a Chemical Reactor 
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Again, consider the previously studied 
case, concerning the chemical reactor, 
in order to evaluate the influence of the 
introduction of the performance model 
on GPC. 



The reactor model remains the same with 



A{q-^) = l-Qmq-^ ; = 1.2 + 0.58 (15.63) 

while C is equal to 1. 

We choose the following polynomial Am for the reference trajectory 

Ara{q-^) = l-Q.^2q-^ (15.64) 

SO as to provide a smooth trajectory, whereas the constant T ensures the steady- 
state gain of B{q~^)T/Am{q~^) equal to 1, hence T = A^(l)/5(1). 

The polynomial influencing the regulation is taken to be equal to 

Af{q-^) = l-0.9q-^ (15.65) 

The control horizon Nu and the prediction horizon are both kept equal to 
3. We obtain as a solution of the hrst Diophantine Eq. (15.49) the following 
polynomials Ej and Fj 

j = l Ei = l Fi = 1.07- 0.97^-1 

j = 2 ^2 = 1 + 1.07^-! F2 = 1.1379 - 1.0379^-1 (15.66) 

j = 3 E3 = 1 + 1.07 ^-1 + 1.1379^-2 Fs = 1.2038 - 1.1038^-1 

For the second Diophantine Eq. (15.49), the polynomials Gj (unchanged com- 
pared to the case without performance model) and Tj are the following 

j = 1 Gi = 1.2 Ti = 1.660 

j = 2 G2 = 1.2 + 2.944^-1 T2 = 3.2702 (15.67) 

j = 3 Gs = 1.2 + 2.944^-1 + 4.6357^-^ Tg = 4.8321 

The matrix G is unchanged, thus also the matrix [G'^G + \I]~^G^ . The poly- 
nomials R and S of the equivalent pole-placement of this control (Fig. 15.4) 
are approximately 

A fl j 

i? = 0.709 - 0.644^-1 ; 5^ = 1 + 0.346^-i ; T= 0.101 

(15.68) 

The simulation results are given and commented on in Sect. 15.6. 
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15.5 Partial State Reference Model Control 

(M’Saad et ah, 1990) extended GPC with reference model under the name 
of partial state reference model adaptive control (Corriou, 1996). Many ap- 
plications have been realized (Bendotti and Msaad, 1993), (M’saad et ah, 
1993), including multivariable control (M’saad and Sanchez, 1992), (M’saad 
and Sanchez, 1994). Using a reference model, this technique allows us to sepa- 
rately specify the regulation and tracking dynamics. In the case of a criterion 
with finite horizons, the approach is similar to that of GPC, whereas in the case 
of infinite prediction and control horizons, the proposed controller is based on 
infinite horizon linear quadratic control, which is stable provided that the pro- 
cess model is observable and stabilizable, while the stability of GPC is linked 
to the synthesis parameters. Moreover, with infinite horizons, it is sufficient to 
specify only one parameter against four parameters for GPC. 



15.6 Generalized Predictive Control of a 
Chemical Reactor 

Example 15.3: Generalized Predictive Control of a Chemical Reactor 

The temperature of the chemical reac- 
tor contents is controlled by manipulat- 
ing the position of a three-way valve, 
connected to two heat exchangers that 
are hot and cold respectively, and al- 
lowing action on the temperature of 
the heat-conducting fluid entering the 
Outlet jacket. 

actor model corresponds to the reactor described in Sect. 19.2 and considered 
in both the previous numerical examples. 

In this simulated reactor, GPG is tested for different values of the prediction 
horizon N 2 and the control horizon Ni. The parameter A is maintained equal 
to 0.1. 

Figures 15.5 to 15.7, have been obtained for the same prediction horizon N 2 
and control horizons Nu increasing from 1 to 3. The response quality is slightly 
improved when Nu increases, although it was already very correct for Nu = 1. 
The peak of control u increases at the instants corresponding to the set point 
changes. Figure 15.8 has been obtained for minima horizons N 2 = l,iVu = 1. 
The control input u presents sharper peaks than when A ^2 = 3. 

The disturbance rejection is tested by imposing a step disturbance of am- 
plitude 0.2 between t = 20 s and t = 30 s (Fig. 15.9), with the horizons N 2 = 3 
and Nu = 3. It is clear that the disturbance is efficiently rejected. 

Gaussian white noise of standard deviation 0.2 was added to the output 
(Fig. 15.10). In spite of this noise, the output follows, in average, the set point 
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without problem. 

GPC with different reference models for regulation and tracking has been 
tested. The inffuence of the introduction of these models appears neatly by 
comparing the new Fig. 15.11 to a similar case, but without models (Fig. 
15.7). The input varies more smoothly, while the output follows very regularly 
its set point trajectory, not represented in the figure (because it is superposed). 
Again, in the case with reference models, the inffuence of white noise on the 
output is less sensitive (Fig. 15.12). 




Normalized Time 



Figure 15.5: GPC of the identified linear model of the chemical reactor (A^i = 
1, A ^2 = 3, Nu = !)• Variations in the controlled output and the control input 
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Figure 15.6: GPC of the identified linear model of the chemical reactor {Ni = 
1,^2 = 3, Nu = 2). Variations in the controlled output and the control input 
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Normalized Time 



Figure 15.7: Generalized predictive control of the identified linear model of 
the chemical reactor (A^i = 1,A^2 = 3, = 3). Variations of the controlled 

output and of the control input 
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Figure 15.8: GPC of the identified linear model of the chemical reactor {Ni = 
1,^2 = 1, Nu = !)• Variations in the controlled output and the control input 
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Figure 15.9: GPG of the identified linear model of the chemical reactor {Ni = 
1,^2 = 3, Nu = 3) with step disturbance of amplitude 0.2 between t = 20 and 
t = 30. Variations in the controlled output and the control input 
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Normalized Time 



Figure 15.10: GPC of the identified linear model of the chemical reactor 
(A^i = 1, A ^2 = 3, Nu = 3) with Gaussian white noise on the output of standard 
deviation 0.2. Variations in the controlled output 
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Figure 15.11: GPG of the identified linear model of the chemical reactor 
{Ni = 1,^2 = = 3) with reference model. Variations in the controlled 

output and the control input 
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Figure 15.12: GPG of the identified linear model of the chemical reactor 
(Ni = 1, V 2 = 3, Nu = 3) with reference model and with Gaussian white noise 
on the output of standard deviation 0.2. Variations in the controlled output 
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Chapter 16 

Model Predictive Control 



Model predictive control (MPC) is widely used in the industry, and many refe- 
rences to industrial experience will serve to present the main characteristics of 
this important control approach. Furthermore, the method of dynamic matrix 
control, about which much literature is available, will be discussed with some 
detail. Finally, some general aspects of nonlinear model predictive control will 
be outlined. 



16.1 A General View of Model Predictive 
Control 

A general objective of control schemes is to maintain the controlled variables 
close to their set points while respecting the process operating constraints. 
MPC has been designed to meet those purposes. It was hrst introduced by 
Richalet et al. (1978) (Adersa) as model algorithmic control (MAC) by ID- 
COM (IDentihcation-COMmand), where the accent was placed on the key role 
of digital computation and modelling; several industrial applications were re- 
ported to emphasize the interest of the proposed method. Very soon after, 
dynamic matrix control (DMC) was published (Cutler and Ramaker, 1979) 
and implemented at Shell as a multivariable computer control algorithm. 

Many review papers have been devoted to the evolution of MPC (Garcia 
et ah, 1989; Keyser et ah, 1988; Lee, 1996; Mayne, 1996; Morari and Lee, 1991, 
1999; Muske and Rawlings, 1993; Rawlings et ah, 1994; Ricker, 1991), inclu- 
ding books (Bitmead et ah, 1990; Camacho and Bordons, 1995; Clarke, 1994; 
Maciejowski, 2002; Sanchez and Rodellar, 1996; Soeterboek, 1992) and several 
papers that relate industrial developments and applications of MPC (Froisy, 
1994; Qin and Badgwell, 1996). Soeterboek (1992), in particular, through a uni- 
hed approach compares different types of MPC schemes such as DMC (Cutler 
and Ramaker, 1979), PC A, AC, CPC (Clarke et ah, 1987a, b), EPS AC (ex- 
tended prediction self-adaptive control) (Keyser and Cauwenberghe, 1985) and 
extended horizon adaptive control (EH AC) (Ydstie, 1984). 
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IMC (Garcia and Morari, 1982) studied in different forms in chapters 4, 8 
and 13, can also be considered as a variant of MPC. 

GPC, which is a single-input single-output form of MPG was studied in 
Ghap. 15. 

Infinite horizon control as linear quadratic (LQ) and linear quadratic gaus- 
sian (LQG) control has been studied in Ghap. 14. 




Instant 



Figure 16 . 1 : Principle of model predictive control with prediction and control 
horizons 

MPG can be defined as a class of control algorithms which compute over a 
future time horizon (Fig. 16.1) a sequence of manipulated variables profiles by 
using a linear or nonlinear model of the plant in order to optimize a generally 
quadratic criterion subject to linear or nonlinear constraints. 

Richalet et al. (1978) very clearly explained the main interest in an opti- 
mization control algorithm. They distinguished four hierarchical levels (from 
lower to higher): 

• Level 0: control of ancillary systems (e.g. valves) where PID controllers are 
efficient. 

• Level 1: dynamic control of the plant as a multivariable process perturbed 
by state and structural non-measured perturbations. 

• Level 2: optimization of the set-points with minimization of cost functions, 
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ensuring quality and quantity of production. 

• Level 3: time and space scheduling of production (planning-operation re- 
search) . 

As the economic benehts of levels 0 and 1 are low compared to those of level 
2, effort should be concentrated on level 2 which can be defined as dynamic 
optimization or open-loop optimal control and concerns the calculation of the 
optimal trajectories to be followed. Of course, robustness aspects should be 
taken into account (Genceli and Nikolaou, 1993; Vuthandam et ah, 1995), but 
the main characteristics are well defined. 

Qin and Badgwell (1996) describes the general objectives of MPC in de- 
creasing order of importance: 

1. Prevent violation of input and output constraints. 

2. Drive the manipulated variables towards their steady- state optimal values 
(dynamic input optimization). 

3. Drive the controlled variables towards their steady- state optimal values 
taking into account the remaining degrees of freedom (dynamic output op- 
timization). 

4. Avoid excessive variation of manipulated variables. 

5. When signals and actuators fail, control as much of the plant as possible. 
The commercial codes for MPC adapt themselves in different ways to these 
rules. A fiow chart for MPC calculation is given by Qin and Badgwell (1996) 
which follows the stages: 

• Read the manipulated variables, disturbances and controlled variables of 
the process at time k. 

• Update the outputs by means of feedback. A bias between the current 
measured and the current predicted output is defined: 

bk = y"^-yk ( 16 - 1 ) 

and this bias is added to the output model (16.5) for the following predic- 
tions 

Vk+j = y{xk+j) + bk (16.2) 

This allows us to eliminate steady- state offset. 

• Determine the controlled subprocess or which inputs must be manipulated 
and which outputs must be controlled by examining the degrees of freedom 
(Froisy, 1994): some outputs cannot be controlled if there are not enough 
manipulated variables; the system is square if there are the same number of 
manipulated and controlled variables; when there are more manipulated va- 
riables than controlled variables, it results in extra degrees of freedom, thus 
the optimization yields several solutions and another subsequent optimiza- 
tion can be performed (example of two-stage optimization in IDCOM-M at 
Setpoint and HIECON at Adersa). 

• Remove ill-conditioning measured by the condition number of G where 
G is the process gain matrix. Different approaches are used in commer- 
cial codes, including the use of a threshold for singular values (RMPCT of 
Honeywell) and neglecting smaller singular values, or controlled variables 
controllability ranks defined by the user (IDCOM). 
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• Perform a local steady-state optimization by various means according to 
the codes: use a steady- state nonlinear model of the plant, use a linearized 
model, use the steady-state optimization problem with constraints. 

• Perform the dynamic optimization, which consists of solving the following 
problem^ 



with: J = Y.% + Ef=o 

subject to the nonlinear model constraints 




^k-\-j — — — l) j — 

Vk+j = 9{xk+j) + bk y j = l,Hp 



and the manipulated and controlled variables inequality constraints 



UfYiin ^ ^ '^max j — 0? 1 

^ ^ ^'^max j — 1 (16.6) 

yj,min — yk-\-j — yj,max Vj = l,iJp 



According to model predictive methodology, only the first element Uk of 
the optimal solution is implemented, e is the deviation between the future 
output and the reference trajectory. Q^, Rj^ Sj are semipositive definite 
matrices which serve as tuning parameters to weight the different contribu- 
tions in the objective function. Reference trajectories can also be used to 
avoid violent input variations and implemented in a second optimization. 

• Hierarchization of constraints: hard constraints should never be violated, 
some constraints might be violated and penalized in the objective function. 

• Specification of output and input trajectories. The output trajectory can be 
specified as a set point (which may never be reachable and lead to large input 
moves), a zone (by soft constraints), a reference trajectory (filter allowing 
model mismatch and improving robustness as the time constant increases) 
or a funnel (a kind of min and max reference trajectories) 

• Specification of the prediction and control horizons. It is possible to only 
consider a subset of the prediction horizon (e.g. in the case of nonminimum 
behaviour) called coincidence points. The prediction horizon should be long 
enough with respect to the influence of the future manipulated variables’ 
variations on the outputs. The control horizon may be finite or limited to 
a single move (case of HIECON). In the case of RFC (Adersa), polynomial 
basis functions are used to parameterize the manipulated variables’ profiles. 

• Identification. Process input-output data extracted from process tests serve 
in identification. Signals such as PRBS, or even sometimes simply steps, 
are currently used. Plant testing is of primordial importance. In linear 

^ ||Aix/e||| is the euclidean norm equal to 



IX /j Su]^ 



( 16 . 3 ) 
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identification, in some packages, only one manipulated variable is allowed 
to vary while the other ones remain at their steady-state value. Some pack- 
ages (DMI), on the contrary, allow several manipulated variables to vary 
simultaneously. During the test, the PIDs are hxed; however, operators 
may intervene to avoid critical situations. 

Linear parameter estimation can be performed by two approaches: equa- 
tion error or output error. Assume a linear state-space model in the form 

^/c + l — -\- ByV]^ -\- ByjW]^ 

yfe = Cxk + ^ ' 

where Vk are measured disturbances, Wk are unmeasured disturbances or 
noises and are measurement errors. From the state-space representation, 
a transfer function matrix results 



= [I ~^y{q ^)] ^)uk+^v{q ^)vk+^m{q (16.8) 

In the output error identihcation method, the measurement error ^{k) is 
minimized by a nonlinear parameter estimation. The following ARX model 
results 



Vk = ^yiq ^)Vk + ^u{q ^)uk + ^v{q ^)vk + ^w{q ^)wk + Ck ( 16 - 9 ) 
with: Cfe = [I - ^y{q~^)^k- 

In the equation error identihcation method, d^{k) is minimized by a linear 
parameter estimation; however, the noise C/c is coloured even if is white. 

Discrete step or impulse response data are often used by MPC schemes 
to represent the process (Li et ah, 1989). This makes it easy to use and 
renders it popular, in particular, in industry. When the system is stable, 
the transfer function matrix (16.8) is approximated by the following hnite 
impulse response model (used in IDCOM, HIECON, OPC) 

yk = J2 + E + E 4 (16.10) 

i=l i=l i=l 



with 30 to 120 coefficients to describe the open-loop response. The model 
from Eq. (16.10) can be transformed into velocity form by simply replacing 
each of its variables: by its variation Ay, Au, Av, Aw, A^. 

Alternatively, the following hnite step response model can be used (DMC) 



Vk = Eti S^Auk-^ + Eti + Eti SfAwk-^ + 



with: So = 0, Si = Sn, > N and: Hi = Si — Si-i 



(16.11) 



In some cases, continuous Laplace transfer functions are obtained from the 
discrete-time models, which allows us to later make various transformations 
according to different sampling periods in discrete time. 
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All identification methods used in MPC minimize the sum of the squares 
of errors between measurements and predictions to obtain parameters 0 

Nm, 

minJ2\\yk-yff (16.12) 

k=l 

In equation error identification, the past measurements are fed back to 
the model (16.9) to obtain the estimates (one- step ahead prediction) 

yfe = ^y{q~^)yf + ^u{q~^)uk + ^v{q~^)vk + ^w{q~^)wk (16.13) 

while in output error identification, the past model estimates are fed back 
to the model (16.9) to obtain the estimates (several-steps ahead, i.e. long- 
range prediction) 

Vf, = + ^u{q~^)uk + ^v{q~^)vk + ^w{q~^)wk (16.14) 

Qin and Badgwell (1996) note that FIR models often result in overparam- 
eterization and that this problem is overcome by different means such as 
regularization and partial least squares, in nearly all industrial packages. 

• Controller tuning. After the specification of all control design features, after 
identification and obtaining of a dynamic plant model, any MPC software 
applied to a given plant needs extensive simulation off-line to test the con- 
troller performance. The regulation and tracking are tested as well as the 
respect of the constraints for controlled and manipulated variables. The ro- 
bustness must also be addressed by simulation in the case of a plant-model 
mismatch. After off-line testing, the controller is implemented on-line first 
in open-loop to verify the model predictions, then in closed-loop where its 
tuning is improved. DMC uses weights on An and y. IDCOM and HIECON 
mainly use the time constant of the reference trajectory. 

16.2 Linear Model Predictive Control 

16.2.1 In the Absence of Constraints 

In the absence of constraints, over a finite time horizon, MPC can be seen as 
GPC (Bitmead et ah, 1990; Clarke et ah, 1987a, b), which was developed using 
discrete-time transfer functions and includes a stochastic ARIMAX model. A 
continuous equivalent of the former discrete predictive control has been deve- 
loped by Demircioglu and Gawthrop (1991, 1992). 

Over an infinite time horizon, linear quadratic (LQ) and linear quadratic 
Gaussian control (LQG) are formulated in the state-space domain and are 
inherently multivariable. LQG is a powerful method that is able to handle 
large-size, nonmimimum-phase systems. 

However, the main drawback of the aforementioned control methods is that 
they do not take into account any type of constraint, including the controlled 
variables as well as the manipulated variables. 
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16.2.2 In the Presence of Constraints 

Engineers in charge of plants are confronted with constraints regarding the 
controlled and manipulated variables in particular. Thus, it is not strange that 
the hrst successful papers relating MPC dealt with important applications, such 
as in the petrochemical industry (Cutler and Ramaker, 1979; Richalet et ah, 
1978). 

16.2.3 Short Description of IDCOM 

IDCOM belongs to the class of model algorithmic control (MAC) which differs 
in particular from DMC by its characterization of the process by its impulse 
response instead of step response for DMC. In IDCOM (Richalet et ah, 1978), 
the process is identified by its impulse responses, where commonly the number 
of parameters for each impulse response would be He = 40. Thus, each output 
yj{n) of a multivariable system is a weighted sum of the past He values ek{n — i) 
of the riu inputs 

Ha 

='^'^(^k,j{i)ek{n-i) (16.15) 

fe=l i=l 

with HeTs {Tg sampling period) larger than the time response of the system. 
The identification is not performed continuously and is realized with the system 
controlled in a supervisory way, which is more favourable for the operators. 
Pseudo random binary signals are mentioned as appropriate test signals. For 
given time durations, reference trajectories are defined and the objective of 
the control algorithm is to compute the future manipulated variables so that 
the future outputs of the internal model will be as close as possible to this 
reference trajectory. Constraints are considered on the manipulated variables, 
their variations and the outputs. 

Since the first publication concerning IDCOM, there have been several im- 
provements (IDCOM-M and HIECON: HIErarchical constraint CONtrol). De- 
tails of algorithm and industrial applications are cited by Qin and Badgwell 
(1996). In particular, a multi-objective formulation with quadratic output 
objective followed by quadratic input objective when the solution to output 
optimization is not unique, a controllability supervisor deciding which outputs 
can be independently controlled, computing only one single move for each input 
which adds robustness at the expense of performance and ranking constraints 
between hard and soft, are incorporated. 

16.2.4 Dynamic Matrix Control (DMC) 

Case of a SISO System 

The algorithmic principle of MPC is summarized in Figure 16.2: at each time 
step /c, a residual r{k) between the process output y{k) and the output pre- 
diction ^*(/c) based on past inputs is calculated. Trajectories (i.e. predicted 
values over the prediction horizon) denoted by y*(/c) and r*(/c) are calculated 
respectively for ^*(/c) and r{k). Then the corrected future output trajectory 
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Figure 16 . 2 : General block diagram of Model Predictive Control 



y{k) is obtained, which is compared to the desired (or reference) trajectory 
(^k) from which the MPC control law results. 

A main drawback of DMC is that it can be only used for stable plants and 
also without integrators: this drawback comes from the fact that only the first 
M coefficients of the step response are considered for modelling the process in 
Eq. (16.16). This can be enhanced by separating the DMC algorithm into a 
predictor and an optimizer (Lunstrom et ah, 1995). This avoids, in particular, 
using too many step response coefficients as in classical DMC. 

Here, classical DMC is first discussed. In DMC (Cutler and Ramaker, 1979; 
Garcia and Morshedi, 1986), a single-input single-output system is represented 
by its truncated step response model 

M 

y{j + 1 ) = yss + E + 1 - i) + d{j + 1 ) ( 16 . 16 ) 

i=l 

where hi are the unit step response coefficients for the i-th time interval, tjss is 
the initial steady-state output {y is not a deviation variable) and d represents 
the unmodelled factors affecting the outputs. Also, Au{k) = u{k) — u(k — 1). 
M is the number of time intervals necessary to reach the steady state; it will 
be called the model horizon or truncation number (thus, hi = hM if i > M). 

In the case where the coefficients gi of the unit impulse response are used, 
the truncated impulse response model results 

M 

y{j + 1) = yss + '^giu{j + 1 - *) +d{j + 1) (16.17) 

i=l 



The unit step response coefficients hi are related to the unit impulse coefficients 
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Qi by the relations 

i 

gi = hi-hi-i and: hi ='^§j (16.18) 

i=i 

Rigorously, if M was the truncation order of the step response model, the 
truncated step response model (Garcia et ah, 1989) should be written as 

M-l 

y{j + 1) =yss+ hi Au{j + 1 - i) + /im {u{j + 1- M) - Uss) (16.19) 

i=l 

but the last term is often omitted on purpose, however for set point tracking it 
must be taken into account. In this expression, u is considered as an absolute 
variable and not a deviation variable hence the use of its initial or steady-state 
value Uss- At time /c, y{k) is known and u{k) is to be determined. 

Considering a prediction horizon Hp and the set point , the objective is 
to compute the future inputs so that the future outputs will be close to the 
set point. Thus, at time /c + /, the output prediction based on past and future 
inputs is decomposed into 

M-l 

y{k + l\k) = i/ss + ^ hi Au{k + / - i) + /im {u{k + / - M) - Uss) 



effect of past inputs 
i 

T ^ ^ hi Au(^k -\- 1 — i) T d(^k T 



i=l 



effect of predicted disturbances 



effect of future inputs 

(16.20) 

At each instant /c, only He future input changes are computed, so that 



Aix(j) = 0 V j > /c + i^c (16.21) 

Thus, beyond the control horizon He, i.e. after instant k-\-He, the manipulated 
input is assumed to be constant. 

From Eq. (16.20), define y"^{k-\-l\k) as the output prediction corresponding 
to the infiuence of the past input variations equal to 

M-l 

^*(/c + l\k) = Aix(/c + / - i) + /iM {u{k + / - M) - Uss) (16.22) 



If / > M — 1, Eq. (16.23) is simplified as 

y^{k + l\k) = yss + hM {u{k - 1) - Uss) 
and Eq. (16.20) is reduced to 



i 

y{k + l\k) = yss + hM {u{k - 1) - )+ E hi Au{k + 1 



(16.23) 



i) + d{k + l\k) 
(16.24) 
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Over a given prediction horizon and assuming M > Hc^ the vector of 
output predictions can be decomposed into 



y{k -\- l\k) = hiAu{k) 

~kyss h2Au{k — 1) + . . . + hM—iAu{k — M -\- 2) 

-\-JiM {u(^k + 1 — M) — Ugs) d{k + 1|^) 

= hiAu{k) + ^*(/c + l\k) + d{k + l\k) 

y{k-\-2\k) = hiAu{k 1) h 2 Au{k) 

~kyss h‘^Au{k — 1) + . . . + hM—iAu{k — M -\- 3) 

-{-Hm (ix(/c + 2 — M) — Uss) d(k + 2|/c) 

= h\Au{k + 1) + h 2 Au{k) + ^*(/c + 2\k) + d{k + 2\k) 

y{k + Hc\k) = hiAu{k + i^c — 1) + h 2 Au{k + i^c — 2) + . . . + hH^Au{k) 

~kyss h}jc-\-iAu{k — 1 ) + . . . + fiM—iAuik + Hq — M + 1 ) 
-\-JiM {u(^k + He — M) — Uss) d{k + He\k) 

= h\Au{k + i^c — 1) + . . . + hH,,Au{k) + ^*(/c + Hc\k) + d{k + Hc\k) 



y{k -\- He ~\~ l|^) — h 2 Au{k -\- He — 1) “h . . . “h h}j^,-\-\Au{k) 
+^*(/c + + l|/c) + d(k + i/c + l\k) 



y(k + ]V[\k) — He — 1) + . . . + Hm A ui^k) 

+^*(/c + M|/c) + d(/c + M|/c) 

y(^k + M -\- l\k) = hM—Hc^ 2 Au(^k + He — 1) + . . . + Hm A u(^k + 1) + Hm A u(^k) 
+^*(/c + M + l\k) + d{k + M + l|/c) 






resulting in 



h]\/[ Aui^k -\- He — 1) -h . . . -h h]\/[ Aui^k) 
^y*{k^Hp\k)^d{k^Hp\k) 
ifHp>He^M -1 



(16.25) 



'y{k + l\k) 




y*{k + l\k) 




Au{k) 




d{k + l|fc) 




= 




+ A 




+ 




1 

+ 




y{k + Hp\k)_ 




Au{k + He — 1) 




J{k + Hp\k)_ 



(16.26) 

where A. is the Hp x He dynamic matrix (hence dynamic matrix control) of the 
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system equal to 



" hi 


0 


.. 0 


\ 


h2 


hi 






hHa 


hHa-l • 


. . hi 


< 


hM 


hM-1 


• • hM-Hc^l 


> 


hM 


hM 


. . Hm 


< 


_ hM 


hM 


. . Hm 


> 



He rows 



(M — 1) rows 

{Hp — He — M 1) rows 

(16.27) 

Equation (16.26) thus shows the influence of future inputs. The dynamic matrix 
A. of Eq. (16.27) is exactly similar to the matrix G of Eq. (15.23) dehned in 
GPC. 

The vector of output predictions y'^{k -\-l\k) corresponding to the influence 
of the past input variations can itself be calculated (assuming Hp > M) as 



y*(k + l\k) 








hM-1 


hM-2 • 


. . /l2 




Uss 




0 


hM-1 






y*{k + M -l\k) 
y*[k + M\k) 


= 




+ 


0 


0 


. . hM-1 


_y*{k + Hp\k) 




_ Uss _ 




0 


0 


.. 0 








_0 


0 


.. 0 



u{k — M + 1) — Uss 



Au{k — M + 2) 
Au{k — 1) 






u(^k 1 ) Ugg 

u{k 1) Ugg 



u{k 1) ^ss 



(16.28) 



The combination of Eqs. (16.16) for j = k — 1 and (16.20) for / = 0 gives 
the influence of the unmodelled effects 



d{k) = y{k) - i/*(/c|/c) 



(16.29) 



Consequently, based on a measured output y^(k)^ an estimation of d(k) is 
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given as 

d{k ^l\k) = d{k\k) = y^{k) -y*{k\k) = 

M-l 

= y"^{k) - [yss + ^ hiAu{k - i) + hM{u{k - M) - «ss)] 

i=l 

(16.30) 

thus the predicted disturbances are all equal to the present estimated distur- 
bance. 

Let us define a quadratic criterion, taking into account the difference be- 
tween the estimated output and the reference over the time horizon as 

Hp 

J = y^^{y{k + i\k) — y^^^{k + i))‘^ (16.31) 

i=l 

From the previous equations, this can be seen as computing the vector of future 
input variations 



Au{k) = [Au{k ) . . . Au{k He — 1)]^ (16.32) 

which is the least-squares solution of the following linear system resulting from 
Eq. (16.26) 

r y^^^ {k -\- 1) — y* {k -\- l\k) — d{k\k) = e{k -\- 1) 1 



+ Hp) -y*{k + Hp\k)~ d{k\k) = e{k + Hp) 



e(/c+l) = A.Au{k) 
(16.33) 



The least-squares solution of (16.33) is 

Au{k) = {A^A)-^A^ e{k + 1) (16.34) 



Only the first input variation of vector (16.32) equal to (16.34) is implemented. 
According to Garcia and Morshedi (1986), the choice of the time horizon Hp 
as Hp = He + M generally results in a stable controller. Garcia et al. (1989) 
simply state that for sufficiently small He and sufficiently large Hp such that 
Hp ^ He ~\~ JH — 1, the closed-loop system is stable. The control law (16.34) 
provides a too strong control action and will be improved in the following by 
introduction of weighting terms as in (16.45). However, Li et al. (1989) assume 
M > Hp and Gamacho and Bordons (1998) choose M ^ Hp. Soeterboek 
(1992) shows that the prediction horizon must be larger when constraints are 
present. 

Case of a MIMO system 

Similarly, a multivariable system {uu inputs, Uy outputs) is represented as 

M 

y{k + 1) = yo + L] ^u{k - i + 1) + d{k + 1) 

i=l 



(16.35) 




Process Control 



587 



where ai is an Uy x Uu matrix of unit step response coefficients for the i-th 
time interval, jjq is the initial output vector and d represents the unmodelled 
factors affecting the outputs. 

Any input-output pair i-j can be represented by a matrix A^j of coefficients 
h perfectly similar to Eq. (16.27) so that the complete system is finally repre- 
sented by a multivariable dynamic matrix composed of elementary matrices 
Aij of type (16.27) as 



All • • • 

• • • Ajlyflu 



(16.36) 



For a 2 X 2 system, the representation of the system equivalent to Eq. (16.26) 
would be 



'yi{k + l\k) 




■ yl(k + l\k) 




di{k + l\k) 




= 




+ 




_yi{k + Hp\k)_ 




_ yt{k + Hp\k) _ 




. di{k + Hp\k) . 





Aui{k) 




Au2(k) 


-\-Aii 




+ Ai2 






Aui{k + He — 1) 




_Au2{k + i7c - 1) _ 



■ mik + l\k) 




'y^{k + l\k) 




d2{k + l\k) 




= 




+ 




_ mik + Hp\k) 




y^{k + Hp\k) _ 




. d2{k + Hp\k) . 





Aui{k) 




Au2{k) 


-\-A 2 i 




+ A 22 






Aui{k + He — 1) 




_Au2{k + i7c - 1) _ 



or, finally. 



(16.37) 
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■ yl{k + l\k) 
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yt{k + Hp\k) 


1 


di{k + Hp\k) 


mik + l\k) 




yl{k + l\k) 




d2{k + l\k) 


_ mik + Hp\k) _ 




_ yl{k + Hp\k) _ 




_ d2{k + Hp\k) _ 



+ 



All Ai2 
A21 A22 



Aixi(/c) 

Aixi(/c + He — 1) 

Au2{k) 

Au2{k ^ He - 1) 



(16.38) 
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Define the vector of future input variations similarly to (16.32) 

Au{k) = [Ani(/c)^ ... (16.39) 

and also the vector of deviations similarly to (16.33) 

e{k + 1) = [ei{k + 1)^ ... Bn.^{k + 1)^]^ (16.40) 

The least-squares solution of the multivariable DMC controller is also given 
by (16.34). Again, Garcia and Morshedi (1986) recommend the choice Hp = 
Hc^M for obtaining a stable controller. A large value of the prediction horizon 
Hp improves stability even if it does not significantly improve performance 
(Shridar and Cooper, 1998). The control horizon He should be taken to be 
greater than 1. 

Some input variations can be suppressed by formulating multivariable DMC 
as 

] = [ A ] 

where A is a diagonal matrix equal to 

A = diag( Ai .^. Ay A2 . . . A2 A^^ . . . A^^) (16.42) 

Ha values 

It is also possible to selectively weight the controlled variables by multiplying 
the equations such as (16.33). The matrix of weights is thus 

r = diag( 7 i . . . 71^ 72 ... 72 7n„ ■ ■ ■ 7« J (16.43) 

Hp values 

The matrix T defined in (16.43) for selective weighting of controlled varia- 
bles and the matrix A defined in (16.42) for suppression of some input variations 
are now incorporated in the criterion. The following quadratic criterion to be 
minimized with respect to Au(k) results 

j=\ [yik) - y^'^f{k)]^vH [y{k) - y^'^fik)] + {k)^J K^u{k) 

= 1 [A.ilu(k) — e(k + l)]^r^r [A.ilu(k) — e(k + 1)] + - AH' {k)A^ AAu{k) 

^ ^ (16.44) 

In the absence of constraints, the solution of (16.44) is 

Am(A;) = (.4^r^r.4 + A^a) A^HTe{k + l) (16.45) 

Again, the control law of DMC in the absence of constraints given by Eq. 
(16.45) is similar to the control law of GPC given by Eq. (15.26). 

A typical industrial study concerning application of DMC to a model IV 
fiuid catalytic cracker is related by Gusciora et al. (1992) who describe the linear 
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matrix model used for control with 11 inputs and 9 outputs, the DMC block 
diagram, the main problems encountered and the benehts generated . Shridar 
and Cooper (1998) propose a tuning strategy for unconstrained multivariable 
DMC. They note that the prediction horizon Hp should be in accordance with 
the settling time of the process. Also, increasing the input horizon He from 
2 to 6 does not modify greatly the closed-loop performance, but it should be 
greater than or equal to the number of unstable modes of the system (Rawlings 
and Muske, 1993). Al-Ghazzawi et al. (2001) present an on-line tuning strat- 
egy based on the use of sensitivity functions for the closed-loop response with 
respect to the MPC tuning parameters. 



16.2.5 Quadratic Dynamic Matrix Control (QDMC) 

The handling of constraints was not completely satisfactory in the original 
DMC which motivated Garcia and Morshedi (1986) to develop a quadratic 
programming solution to the DMC problem. Different types of constraints 
(soft, which can be violated, and hard constraints) are commonly encountered: 

— Constraints affecting the manipulated variables such as valve saturations 

'^min ^ ^ '^^max (16.46) 

— Constraints affecting the controlled variables: e.g. avoid overshoots. 

— Constraints affecting other variables which must be kept within bounds. 

— Constraints added to the process in order to avoid inverse responses result- 
ing in nonminimum-phase behaviour. 

— Terminal state constraints. 

All these constraints can be resumed as a system of linear inequalities incorpo- 
rating the dynamic information concerning the projection of constraints 



B Au{k) < c{k + 1) 



(16.47) 



where B contains dynamic information on the constraints and c(k + 1) con- 
tains projected deviations of constrained variables and their bounds. It has 
been stated in the literature (Camacho and Bordons, 1998; Garcia and Mor- 
shedi, 1986; Maciejowski, 2002; Soeterboek, 1992) how to represent the afore- 
mentioned cases of constraints; note that Au{k) also contains the prediction of 
the future input changes. 

In the presence of constraints (16.46) and (16.47), the problem can be thus 
formulated as quadratic programming, such as 



min^ ^Au{k)'^ HAu{k) — g{k -\- Au{k) 



(16.48) 



subject to constraints (16.46) and (16.47). H is the Hessian matrix (in general 
fixed) equal to 



H = .4^r^r.4 + A^A 



(16.49) 
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and gf(/c + 1) is the gradient vector equal to 

g{k + 1) = A^V^Ve{k + 1) (16.50) 

This quadratic problem can be solved efficiently by available numerical sub- 
routines based, for example, on Rosen’s method (Soeterboek, 1992), conjugate 
gradients or a quasi-Newton method (Camacho and Bordons, 1998; Fletcher, 
1991). 

In QDMC, the choice of the projection interval to be constrained is impor- 
tant, as not all the horizon Hp necessarily needs to be constrained. For example, 
for a nonminimum-phase system (dead time or inverse response), shifting the 
constraint window towards the end of the horizon is favourable. 

Another version of DMC, called LDMC, where the criterion concerns the 
sum of the absolute values of the errors has been developed (Morshedi et ah, 
1985). In this case, the optimization problem is solved by linear programming. 

In some cases, stability due to output constraints can be found with QDMC 
(Muske and Rawlings, 1993) for open-loop unstable systems. 



16.2.6 State-Space Formulation of Dynamic Matrix 
Control 

Assume that the system is described by a state-space discrete-time model of 
the form 

x(k + 1) = Ax(k) + Bu(k) 

y{k) = Cx{k) ^ ' 

Assuming zero-initial conditions (x, u and y are deviation variables), the equiv- 
alent discrete-time transfer matrix is 






(16.52) 



Assuming that A is stable, G{z) can be expanded as 

oo oo 

G{z) ='^§iZ~^ z:=^Y{z) ='^giZ~^U{z) (16.53) 

i=l i=l 

where gi are coefficients of the impulse response, decreasing exponentially. For 
this reason, in the time domain, the impulse response is generally truncated at 
order n as 

n 

y{k) = N “ *) (16.54) 

i=l 

If the input u is chosen as a unit step, it results 



k 

y(0) = 0 , y(l) = gi , y{2) = gi + §2 , ■ ■ ■ , y{k) ='^Si 

i=l 



(16.55) 
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Define 



k 

hk = '^ 9 i y{k) = hk (16.56) 

i=l 



The hk are the step response coefficients. The following relation can also be 
given 

Si = hi- hi-i (16.57) 



Equation (16.54) can be expressed with respect to the step response coefficients 
as 



n— 1 

y{k) = '^^hiAu{k—i)-\-hnu{k—n) with: Au{k) = u{k)—u{k — l) (16.58) 

i=l 

Thus, any state-space discrete-time system can be transformed into a truncated 
step response model which can be later used in the DMC or QDMC formulation 
(Garcia et al., 1989). However, this transformation remains artificial and does 
not constitute a real state-space method. 



16.2.7 State-Space Linear Model Predictive Control as 
OBMPC 

The state-space approach is appealing, as it allows us to avoid problems due 
to the truncation of the step response in the classical DMC approach. Li et al. 
(1989) proposed first state-space formulation for MFC. State-space linear MFC 
was developed in a similar spirit (Lee et al., 1994; Lunstrom et al., 1995; Ricker, 
1990a). In these papers, two stages are used for the model prediction: first, a 
predictor of the output is built, then a state observer is used to estimate the 
states. The observer-based model predictive control (OBMFC) developed by 
Lee et al. (1994), taken again by Lunstrom et al. (1995), is discussed in the 
following. 

The original idea (Li et al., 1989) is to represent the entire trajectory for a 
SISO system as a sequence of states 



xi{k) = X 2 {k — 1) + /ii Au{k — 1) 



Xi{k) = XiJ^i{k — l)^hi Au{k — 1) i = 1, M 



(16.59) 



where hi are the step response coefficients, and to use these equations with 
respect to the predicted output corresponding to the infiuence of past variations 



^*(/c + i — l\k) = ^*(/c + i — l\k — 1) -\- hi Au{k — 1) (16.60) 



Note that the infiuence of the input disappears for a stable process when i > M 
as hi = hss (steady-state). 

In the case of a MIMO system with riu inputs and Uy outputs, the matrix 
Si is defined at each instant i to represent each coefficient of the step responses 
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as 



^ 2 , 1 ,^ h2^2,i 



h\^nu,i 

h2,riu,i 






^ny,2,i 



(16.61) 



where hk^i^i is the step coefficient at instant i of output k with step input 1. 

It is assumed that at time /c, the inputs to be determined are u{k) and the 
future control actions. 

In the absence of disturbances, the state-space form corresponding to the 
step response model can then be written as 



Y{k) =^Y{k-l)^S Au{k - 1) 
y^{k\k) =^Y{k) 



(16.62) 



This original state-space model (16.62) can be extended (Lunstrom et ah, 
1995) in a similar way to the LQG state-space model to include disturbances 
w and output noise v as 



Y{k) = ^ Y{k - 1) ^SAu{k - 1) ^TAw{k - 1) 

y*{k\k) =^Y{k) (16.63) 

y{k) = y*{k\k) + v{k) 



with 

Y{k) = [y*(/c|/c)^ y*(/c + l\k)^ . . .y*(/c + M - l\k)^ Xu{k)^ x^u{k)^]^ 

y*{k+i\k) = [yl{k + i\k) . . - yl^{k + i\k)]’^ 

Au{k) = [Aui{k ) . . . Aun^{k)]'^ 

(16.64) 

the matrix ^ of size {M.riy + dima:^-^^ + dima:^^^) 



0 In 0 ... 0 0 0 

fly 

0 0 In ‘ . 0 : : 

fly 



0 0 
0 0 
0 0 



In 0 0 

^riy 

0 Au 0 

0 0 Ayy 



the matrix S equal to 



Si 



the matrix ^ equal to 



S = 



Sm 

Bu 

0 



^=[1. 



(16.65) 



(16.66) 



(16.67) 
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and the matrix T equal to 



T= [ 0 0 






(16.68) 



Residual plant dynamics and disturbance dynamics have been respectively 
incorporated in (16.63) in state-space form as 



^it(^ — -^u ^it(^) Bu zXrx(/c) 

— -^w ^ci(^) Buj zX'm(/c) 



(16.69) 



Moreover, the deviation between the last and previous predictions considers 
the additional residual plant and disturbance dynamics as 

y^{k ^ M - l\k) = y^k ^ M - l\k - 1) ^ Sm An(/c) + Cu Xu{k) + x^{k) 

(16.70) 

The future outputs are predicted by using a state observer such as the 
optimal linear Kalman hlter of matrix gain K. In a general way, the predictions 
of the future outputs would be given by the two- stage form: hrst, the model 
prediction 

Y{k + l|/c) = ^ Y{k\k) ^SAu(k) (16.71) 

then, the correction based on measurements 



with 



Y{k\k) = Y{k\k - 1) + K [y{k) - y*{k\k - 1)] (16.72) 

y*(/c + M- l|/c)^ 



Y {k\k — 1) = [y*(/c|/c — 1)^ y*(/c + l|/c — 1)^ 

Xu Xuj] 

y*{k\k-l) = <i/Y{k\k) 

(16.73) 

Now that the model states are estimated, the optimization can be performed 
in a similar way to QDMC (Garcia and Morshedi, 1986). The objective function 
to be minimized with respect to ALi(k) is 

j = II r [y{k + i\k)~ n{k + i\k)] f + \\AAu{k\k) f 

with 

AU{k\k) = [ An(/c|/c)^ Au{k + l|/c)^ . . . Au{k He — ^k)"^ 
y{k ^l\k) = Y{k\k) + Sh, AU{k\k) 

= [yf{k + l\kY ■■■ yf{k + Hp\kYY 
n(k + l\k) = [r{k + l|fc)^ ...r(k + Hp\k)'^Y' 

where TZ is the reference trajectory. Note that the He future input moves are 
used in y{k + l\k) of components y^. The matrix Sh,, is 



(16.74) 



(16.75) 



^H„ = 



Si 


0 0 


.. 0 


S2 


Si 0 . 


.. 0 


Sffc 


S>Hc-l ■■■ 


Si 


Sffp 


Sh,,-i ■ ■ ■ 


SHp-Hc + 1 



(16.76) 
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and is 

= [ lup.Hp 0 ] $ (16.77) 

In the absence of constraints, the least-squares solution of OBMPC ex- 
pressed by criterion (16.74) is 

AU{k\k) = [5y A]-1 S^Hp [7^(fc + l\k)-^Hp Y{k\k)] 

(16.78) 

Only Au{k\k), first component of AU{k\k), is implemented. 

In Lunstrbm et al. (1995), the residual dynamics represented by {Au, 

Cu) are approximated as Uy x riu single-input single-output first-order systems, 
thus linking each input to each output. These authors also assume classically 
that the measurement noise v is uncorrelated white noise. Moreover, the distur- 
bances d are assumed to be integrated white noise filtered through first-order 
dynamics. This allows us to obtain a very simplified Kalman filter, which avoids 
solving a large Riccati equation (Lee et ah, 1994; Lunstrbm et ah, 1995) 

16.2.8 State-Space Linear Model Predictive Control as 
General Optimization 

A more general but less formalized optimization approach can be adopted. 
Consider the system described by a state-space discrete-time model in the form 

x(k + 1) = Ax(k) + Bu(k) . x 

y{k) =Cx{k) ^ ^ ^ 

and the constraints written as 



Ex + Fu < T (16.80) 

The control problem consists of solving the open-loop optimization (Morari and 
Lee, 1999) 



Hp-l Ha-l 

min J = x{Hp)'^ Sox{Hp) x{i)'^ Qx{i) Ru{i) (16.81) 

^=0 ^=0 

subjected to the constraints (16.80). Hp is the prediction or output horizon. 
He is the control or input horizon. 

In the case where both prediction and control horizons are infinite and where 
no constraints are present, the linear MFC becomes the classical discrete-time 
LQ problem. 

When the prediction horizon is finite, in the framework of MFC, this is 
referred to as a receding horizon control problem, as only the first control 
IX* (0) of the optimal sequence ix*(x), ^ = 1, He — 1, is implemented. In the case 
where both control horizons are finite, this is a classical optimization problem, 
for which available numerical subroutines exist. However, contrary to the case 
of the LQ control whose static state feedback control law (through the solving 




Process Control 



595 



of an algebraic Riccati equation) guarantees closed-loop stability, this problem 
is not obvious for linear MPC and several related questions are addressed by 
Morari and Lee (1999): 

— It is possible that the constraints (16.80) render the optimization problem 
infeasible. 

— As the optimization problem is solved in open-loop, it may occur that the 
closed-loop system goes out of the feasible region. In commercial algorithms 
(Qin and Badgwell, 1996), soft constraints exist which can be violated during 
some time, contrary to hard constraints. They are penalized in the objective 
function. 

— In the case of an unstable system, in general the system cannot be stabilized 
globally (some states are not stabilized) when there are input saturation 
constraints. 

This absence of stability question of the hnite horizon model predictive con- 
troller was solved by using a penalty on an inhnite horizon, although their num- 
ber of decision variables remained hnite (Muske and Rawlings, 1993; Rawlings 
and Muske, 1993) . 

Even if the most encountered criterion to be minimized in the case of MPC 
is a quadratic one such as (16.31), (16.44) or (16.74), other possibilities exist. 
For example, Ricker (1990b) dehnes a linear objective function subjected to 
constraints, thus leading to a linear programming problem for which general- 
purpose packages exist. This linear objective function takes the form 

J = wj \y^{k) - y{k) \ +w'^y + \Au{k) \ +wlu (16.82) 

where w are weights. In this linear objective case, stability and robustness 
issues can only be dealt with by simulation. 



16.3 Nonlinear Model Predictive Control 

16.3.1 Nonlinear Quadratic Dynamic Matrix Control 
(NLQDMC) 

Given a nonlinear state-space model, it is tempting to linearize it around a 
variable operating point in order to further apply linear MPC. This is the 
simplest nonlinear approach, which is useful when the process to be controlled 
presents strong nonlinearities. It was proposed by Gattu and Zahriou (1992, 
1995) who adapted the nonlinear version of QDMC proposed by Garcia (1984) 
by incorporating a state estimator. Their controller can cope with unstable 
and integrating processes. The algorithm can be described as follows: 

— Variables known at sampling instant k: measurement y{k), estimation 
x{k\k — 1) of state at time k based on information available at time /c — 1, 
manipulated variable u{k — 1), sampling period Tg. 

— Objective: compute the He future input variations from which only the 
hrst one will be implemented. 
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Step 1: Linearization of the continuous nonlinear model 

x{t) = f{x{t), u{t)) , y{t) = h{x{t), u{t)) 

at [x{k), u(k — 1)] to get the continuous linearized system 

x{k\k — 1) = Ak x{k\k — 1) ^ Bk u{k — 1) 
y(k) = Ck x{k\k -1) ^ Dk u{k - 1) 



with 



Ah = 



dx 

dh 

k — 

OX 



Bk = 



x,u 



x,u 



du 
dh 

Dk — t; — 

du 



x,u 



x,u 



— Step 2: Discretization of the linearized model (16.84) as 

Xj _|_ 1 — 4“ r /j; Uj 

Vj = Ck Xj + Dk Uj 



(16.83) 



(16.84) 



(16.85) 



(16.86) 



— Step 3: Computation of the step response coefficient matrices Si^k as 



Si,k = Y.Ck^i-^Vk (i = l,...,Fp) (16.87) 

J = 1 

or obtaining of these step response coefficient matrices by integration of the 
nonlinear system (16.83) over Hp sampling periods with initial condition 
X = 0 and u = 1. The predicted output variation due to the future input 
moves is 

i 

Ay{k + l\k — 1) = Si^kAu{k -\- 1 — i) (/ = !,..., Hp) (16.88) 

i=l 

with Au{k) = u{k) — u{k — 1). 

— Step 4: Computation of the gain Kk of the discrete Kalman linear filter 
(Sect. 11.1.2). 

— Step 5: Prediction of the influence of past input variations. Define 

y*(/c + /|/c — 1) {I = 1, . . . , Hp) as the future output prediction assuming that 
all future inputs are constant and equal to u{k — l). This step is decomposed 
into four stages. 

Stage 5-1: The predicted states x"^{k\k — 1) are taken to be equal to the 
estimates x{k\k — 1). 

Stage 5-2: The disturbances at time k are d{k\k) = y{k) — h{x{k\k — 1)) 
where y{k) is the measurement vector. 

Stage 5-3: The future disturbances are considered to be equal to the present 
ones d{k + l\k) = d{k\k) (/ = !,..., Hp). 

Stage 5-4: Integrate sequentially the nonlinear state-space system x = 
f{x^u) with initial condition x"^{k I — l\k — 1)^ {I = l,...,iLp) and 
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u = u{k — l) over one sampling period, then correct the result of integration 
by adding K^d to obtain the predicted state vector {k l\k — 1) . Deduce 
the predicted output based on the past input variations y*(/c + / |/c — 1) = 
h{x^{k^l\k-l)) 

— Step 6: Computation of the predicted output considering the influence of 
past and future input moves and estimated disturbances 

i 

y{k+l\k-l) =y*{k+l\k-l)+J2Si,kAu{k+l-i)+d{k\k) , l = l,...,Hp 

i=l 

(16.89) 

— Step 7: Optimization by minimization of the quadratic criterion, that is 
completely similar to (16.74), taking into account the predicted outputs y 
on the prediction horizon from k 1 to k Hp^ while the input moves 
An are considered on the control horizon from k to k He. The criterion 
is minimized with respect to these input variations. Gattu and Zahriou 
(1992) note that this optimization problem can be formulated as a standard 
quadratic programming one. 

— Step 8: Implementation of the hrst input move Au{k). 

— Step 9: Update of the state vector by integration over one sampling pe- 

riod of the nonlinear state-space system x = f{x,u), with initial condition 
x"^{k\k — 1) and just calculated u = n(/c), then correct the result of integra- 
tion by adding Kkd to get x"^{k l\k). 

(Gattu and Zahriou, 1992) remark that this approach does not reject perfectly 
the step disturbances, but that the closed-loop system is stable in that case. 



16.3.2 Other Approaches of Nonlinear Model Predictive 
Control 

Different approaches are used in nonlinear MPG (NMPG). However, this re- 
mains a domain in development where much research is actually being carried 
out (Allgower and Zheng, 2000). The type of nonlinearity of the model can 
vary greatly even including second-order Volterra models (Doyle et ah, 1995; 
Genceli and Nikolaou, 1995; Maner et ah, 1996) or neural networks. In general, 
state-space models are required. Qin and Badgwell (2000) give an overview of 
NMPG applications. Interesting reviews about NMPG (Allgower et ah, 1999; 
Bequette, 1991; Ghen, 1997; Mayne et ah, 2000; Morari and Lee, 1999) have 
been published, where they insist on two main obstacles concerning the exten- 
sion of MPG from linear to nonlinear systems: 

• the stability issue for constrained hnite horizon systems. Different types 
of constraints may be introduced to guarantee stability for linear MPG. 
Similar approaches are often used for nonlinear MPG. Also an inhnite or 
quasi-inhnite horizon is used. 

• the computational burden: a nonlinear optimization problem must be solved 
on-line and there is generally no guaranty of Ending a global optimum. 
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Given the following nonlinear state-space model 

x{t) = f{x(t),u(t)) , x(0) = Xo 

subject to input and state constraints (given in the simplest form) 

X(t) e X , with: X = {X e W^\Xmin <X< Xmax} 

u{t) e U , with: W = {m G K”"j <u< Umax} 

for stability studies, the NMPC problem is often formulated as a finite horizon 
open-loop optimal control problem (Chen and Allgower, 1998a,b) (see Chap. 
14) as 



(16.90) 



(16.91) 



rt-\-Tp 



mm 

ii 



J{x,u,Tp) = J^ {\\^T;x{t),t)\\Q + \\u{T)\\%)dT (16.92) 



subject to 
and 



x{t) = f{x^ii) , x{t; x^t) = x{t) 



(16.93) 

(16.94) 



x{r; x{t)^ t) ^ X 

u{j) G U 

\\x\\q = x^Qx is the Euclidean norm weighted by the positive definite matrix 
Q (same for R). Tp is the prediction horizon. The control horizon Tc not 
mentioned here is such that Tc < Tp. u and x are the predicted variables by 
the controller differing from the actual values u and x on the plant. 

For normal use, the functional F under the integral would be formulated as 



F{x, u) = {x — x^)"^ Q (x — x^) + (n — n’^)^ R{u — u^) 



r\T 



(16.95) 



where x'^ and are reference trajectories. However, for the stability study, it 
suffices to consider {x^^u^) = (0,0) as the steady-state point. 

Because of the finite horizon in (16.92), closed-loop stability cannot be 
guaranteed in general, although it can be obtained by appropriate parameter 
tuning of an MPC scheme. It might be possible to use an infinite horizon as in 
LQG control to guarantee closed-loop stability; however, this would result in 
an infinite or high-dimension optimization problem, which is not desirable. For 
that reason, researchers have suggested different solutions to solve the closed- 
loop stability problem. 

Mayne and Michalska (1990) introduced a terminal equality constraint 

x{tFTp)=^ (16.96) 

which forces the state to zero at the end of the prediction horizon. The optimal 
objective function can then be considered as a Lyapunov function. The same 
authors (Michalska and Mayne, 1993) relaxed this condition by transforming 
it into a terminal inequality constraint 



x{t T Tp^ G 



(16.97) 
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where is a terminal region which is a region of attraction for the nonlinear 
system controlled locally by a linear state feedback law: u = Kx. Thus, 
a dual-mode controller is proposed as a receding horizon controller: outside 
the terminal region, a receding horizon controller is applied and inside the 
terminal region, the linear state feedback law, so that switching between the 
two controllers must be performed. To be feasible, the state at the end of the 
hnite horizon must be on the boundary of the terminal region. The following 
optimization problem must then be solved 

min J{x, n, Tp) (16.98) 

ii,Tp 

where Tp (equal to Tc) becomes a minimization variable. 

Chen and Allgower (1998a,c) added a terminal penalty term similarly to 
Michalska and Mayne (1993) to the hnite horizon objective function which 
becomes 



rt-\-Tp 

mmJ{x,u,Tp) = \\x{t+Tp; x{t),t)\\p+ / {\\x{t; x{t),t)\\Q + \\u{t)\\p ) dr 

u Jt 

(16.99) 

to obtain the quasi-inhnite NMPC which approximates to inhnite horizon pre- 
diction. The constraints are now 



x{t) = , x{t, x^t) = x{t) 

x\r]x(t),t) ^ X , rG[t,t + Tp] 
u{t) , r G [t, t + Tp] 

x{t XTp] x(t), t) G Q 



(16.100) 



When the quasi-inhnite horizon objective function is written in more general 
terms than the quadratic terms used in (16.99), hnding the terminal region is, in 
general, very difficult. In the case of the quadratic expression (16.99) and when 
the system admits a linearizable part that is controllable, Chen and Allgower 
(1998b) proposed the following off-line procedure (very similar to Michalska 
and Mayne (1993)) in order to compute the terminal region: 

Step 1: Calculate the Jacobian linearization (A, B) of (16.90), then determine 
the locally stabilizing linear state feedback: u = K x 
Step 2: choose a positive constant a < —\max{^K) and solve the Lyapunov 
equation 

{Ak + alf P + P {Ak + al) = -{Q + RK) (16.101) 



to obtain the positive dehnite matrix P, with Ak = AT B K. 

Step 3: Find the largest [3i dehning the region such that the state and 
input constraints are satished for x G ft i 



G I x^Px < Pi} 

and Qi C X and Kx Va:^ G 



(16.102) 
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Step 4: Find the largest (3 c]0,/3i] specifying a terminal region Q 

^4 = G I x^Px < (3} (16.103) 

such that the optimal state solution of the following optimization problem 
is non-positive 



nmx {x"^P(l){x) — ax^ Px \ x^ Px < /3 } (16.104) 

with: (j){x) = f(x, Kx) — Akx. 

Step 5: Choose the prediction horizon satisfying 

Tp>T,ATs (16.105) 

where Tg is maximum time needed for the uncontrolled system to reach ft 
starting from xq. At most, Tg could be chosen as the settling time. 

The approaches that have been previously described share the objective of 
satisfying closed-loop stability and possible real-time implementation. Other 
approaches closer to optimization are possible especially, for large-scale systems 
(Biegler, 2000; Doyle et ah, 1997; Van Antwerp and Braatz, 2000). For example, 
a direct multiple shooting method that is applied on-line to a large scale system, 
i.e. a high-purity distillation column has been proposed (Bock et ah, 2000; Diehl 
et ah, 2001). 



16.4 Model Predictive Control of a FCC 

16.4.1 FCC Modelling 

Description of the Fluid Catalytic Cracking Process 

The fluid catalytic cracking (FCC) unit contains three main parts: the reactor 
riser, the particle separator and the catalyst regenerator. The catalytic cracking 
of petroleum fractions provides a large range of products including gasoline, 
light olefins, gas oil, paraffins, naphthenes, aromatics and olefinic polymers. 
The FCC process has been used with many evolutions for more than half a 
century (Avidan and Shinnar, 1990). 

The riser is considered as a plug flow reactor. The space time for the riser 
is a few seconds. Due to coking in the riser, the catalyst loses its activity in a 
few seconds. The cracking reaction is endothermic. 

After the riser, a larger vessel, called the separator, is situated which has 
two roles: this vessel hosts the riser cyclones and stripping steam enters this 
particle separator so that the catalyst is separated from the gaseous products. 
The separator is considered as a CSTR. 

The regenerator is designed to regenerate the catalyst by combustion of the 
coke. It is a fluidized bed composed of two zones: the dense zone, where most 
of the catalyst is present, and the dilute zone, where practically no catalyst is 
present. Some authors consider that the dense zone is divided into two phases: 
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Gaseous Products 




the emulsion phase, which acts as a CSTR, and the bubble phase, which acts as 
a plug flow. A good description of the behaviour of the regenerator considered 
as a fluidized bed can be found in literature (Kunii and Levenspiel, 1991). 

The importance of FCC control is emphasized by many authors (Arbel 
et ah, 1996; Grosdidier et ah, 1993). The process dynamics are complex. The 
FCC optimization is primordial for the rehnery economics. The choice of the 
control structure is important. Moreover, operating constraints and the nonli- 
near behaviour of the process make the process control problem very attractive 
for performing multivariable algorithms such as MPC ones. 

A Short Review of FCC Models 

Most of the FCC literature studies are based on empirical or semi-empirical 
models, which usually conform quite well with the real industrial process in a 
small operating range. However, when the operating conditions change, their 
validity can fail. So these types of models are unreliable. Many models do 
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not describe important variable variations such as pressure effects (Balchen 
et ah, 1992; Hovd and Skogestad, 1993), use oversimplified kinetics or very ap- 
proximate models (Ansari and Tade, 1997; Christofides and Daoutidis, 1997; 
Khandalekar and Riggs, 1995). In most published papers, the greatest differ- 
ences deal with the modelling of the dense bed in the regenerator. Although 
the bubbling bed model (Kunii and Levenspiel, 1991) is often referred to, due 
to the complexities of the proposed approaches and some lack of information, 
the degree of refinement of the regenerator model very much differs according 
to the authors (Arbel et ah, 1995a). Authors even disagree on the necessity of 
taking into account the spatial character for the bubble phase in the dense bed 
(Lasa et ah, 1981) although this is more realistic and considered by more de- 
tailed papers (McFarlane et ah, 1993). The freeboard over the dense bed is not 
often considered in the modelling. Of course, these differences concerning the 
FCC models have strong implications on the frequently related instability is- 
sues (Arandes and de Lasa, 1992; Arbel et ah, 1995b; Elnashaie and Elshishini, 
1993; Elshishini and Elnashaie, 1990). On the other hand, a full model such as 
the one proposed by Han and Chung (2001a, b) probably incorporates too much 
complexity for the objective of control itself. In particular, partial differential 
equations are not always necessary when time scales have largely different or- 
ders. Thus, the question of an adequately reduced model is crucial and gives 
rise to different models with regard to the pursued objective. 

A good compromise is the reliable model of moderate complexity which has 
been proposed by Rohani and co-workers (Ali et ah, 1997; Ali, 1995; Malay, 
1998). In their studies, a four-lump reaction scheme (Lee et ah, 1989) is used for 
the riser reactions, the bubbling bed Kunii-Levenspiel for the regenerator with 
ordinary differential equations for the dense phase and spatial effects for the 
gaseous species in the bubble phase; the pressure effects have been incorporated. 



Description of the Considered FCC Model 

The model is adapted from those used by Balchen et al. (1992) and Hovd and 
Skogestad (1993). In their study, the spent and regenerated catalyst fiow rates 
are equal although they are not in real FCCs. Pressure effects are not taken 
into account. Their riser model is inspired from Lee and Groves (1985) who 
used a three-lump kinetics. Their regenerator model comes from Errazu et al. 
(1979). In the present model, only the case of partial combustion of coke in the 
regenerator is considered. Typical constraints for FCC variables are commented 
in the literature Monge and Georgakis (1987). 

Riser: 

Balchen et al. (1992) avoided the use of spatial derivatives by introducing in- 
termediate variables at a specified height in the riser. We have checked that 
this allows to obtain very similar results to the classical case where spatial 
derivatives are used. However, the latter usage is more immediate and more 
common so it was preferred. 

The feed temperature Tris{z = 0) at the riser inlet is derived from an energy 
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balance at the inlet 

Peat Pp,cat {Treg Pris (0)) = Ffeed [Cp,ol {Tboil - Tfeed)F 



^Hvap + Fp,og {Tris{^) ~ Tboil)] 



(16.106) 



resulting in 






Feat Fp,cat Trpeg Ff^ed {Fp^ol {Tboil - Tfeed) + ^Hvap Fp,og Tboil) 



Feat Fp,cat F Ffeed Fp,og 

(16.107) 

In the riser, only two components are considered: gas oil and gasoline. For a 
more complex lumped model, similar equations would be obtained. The kinetic 
constants follow Arrhenius law, for gas oil consumption 



ki = kio exp{-Eaf/{RTris)) 
and for gasoline production 

h = ho exp{-Eag/{RTris)) 
The catalyst undergoes a deactivation along the riser 



(16.108) 

(16.109) 



(^ = (1 - m Ceoke,reg) 6Xp(-a te Z Cowr) with: Cowr = Featj Ffeed 

(16.110) 

Two mass balances give the following respective spatial equations for gas oil 
and gasoline weight fractions 



4 



= -h Vgo 



^Vgo _ 7 ^ _2 

^ = (Q!2 ki ygo - ks Vg) Co 



(16.111) 



while the energy balance gives the temperature variation along the riser 
dTrpig /S.Hrp erack Ffeed dy. 



ygo 



dz 



Feat Fp, cat F FfeedFp,ol F ^ F feed Fp, steam dz 



(16.112) 



The temperature at the exit of the riser is denoted by Tris{l)- The concen- 
tration of coke produced is 



F coke, prod — W A GXp I 



E, 



acf 



RTr^s{l) 

SO that the concentration of coke leaving the riser is 

Fcoke,ris{h ~ F coke,reg F Ccoke,prod 



(16.113) 



(16.114) 



Separator 

A separator follows the riser simply with a mass balance for the coke on the 
catalyst 

dC eoke,sep Feat {Fcoke,ris{h F coke,sep) 



dt 



‘^cat^sep 



(16.115) 
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and an energy balance giving the temperature variation 



(16.116) 



Note that ^^^^(l) — Tg^p and Cc,ris{^^ — ^coke,sep' 

Regenerator 

Complex models describing the regenerator behaviour in detail are available 
in the literature. Here, a very simplified model is used. The model assumes 
partial combustion of coke, i.e. remains of CO leave the regenerator. 

First, the molar ratio a of CO 2 to CO in the dense bed is introduced as an 
empirical relation 



cr = 0.000953 exp(5585/T^e^) if Treg < 803 

cr = 1 + (Treg ~ 803) 0.00142 if 803 < Treg < 873 (16.117) 

cr = 1.1 + (Treg ~ 873) 0.0061 if 873 < %eg 

The heat of combustion of coke is equal to 

Ai7c6 = -Ai7i - Ai72 (Treg ~ 960) + 0.6 (Treg ~ 960)^ (16.118) 

The rate of coke combustion (kg/s) is equal to 

cb — kcb GXp( Eacb / (RTreg)) Ceoke,reg '^cat,reg (16.119) 



A first mass balance gives the derivative of coke on the catalyst 



dCeoke. 

dt 



reg 



Feat (C coke, sep Feoke,reg) cb 

Xricat,reg 



(16.120) 



and a second mass balance gives the derivative of mole fraction of O 2 in the 
dense bed 



dXQ^ 1 r F nir / {1 + cr)ncH +2 + 4cr Veb , 

7 . ^ A/T \p^02,in XO 2 ) A (T _i_ \ A/T ‘ 

kn i'^air,req ^ O' ) -^^J-w,coke 

(16.121) 

The energy balance gives the temperature variation 



dT 



reg 



dt 



Xricat,reg Fp^eat 



Feat Fp^eat Fgep T -F/ 



‘ Fp^air Fair 



(Feat Fp^cat T Fair Fp^air) Freg 



^cb ■ 



Tcb 



Fdw,coke 

(16.122) 

Thus, the FCC model is constituted by seven time-ordinary differential 
equations. The spatial equations describing the riser are integrated at each 
time step and the corresponding variables are then updated. 

The values of the different parameters necessary for model simulation are 
given in Table 16.1 and the steady- state values are given in Table 16.2. 

The manipulated inputs are the ffow rate of regenerated catalyst ui and the 
ffow rate of air 1 x 2 - The controlled outputs are the temperature at the outlet of 
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Table 16 . 1 : Parameters for FCC model 



Symbol 


Signihcance 


Value 


Feat 


Mass flow rate of catalyst (kg.s“^) 


294 


Ffeed 


Mass flow rate of oil in the feed (kg.s“^) 


40.63 


Fair 


Mass flow rate of air to regenerator (kg.s“^) 


25.378 


Tair 


Temperature of air to regenerator (K) 


360 


Tfeed 


Temperature of feed (K) 


434.63 


Tboil 


Boiling temperature of feed (K) 


700 




Heat of vaporization of oil (J.kg“^) 


1.56 X 10® 


Cp^ol 


Heat capacity of oil as a liquid (J.kg“^.K“^) 


2671 


Fp,og 


Heat capacity of oil as a gas (J.kg“^.K“^) 


3299 


Facf 


Activation energy for coke formation (J.moH^) 


41.79 X 10® 


'^prod 


Exponent in expression of coke produced 


0.4 


t 


Time (s) 




tc 


Catalyst time residence in riser (s) 


9.6 


Z 


Dimensionless height in the riser 




kc 


Rate constant for catalytic coke formation 


0.0176 


Fp^air 


Heat capacity of air (J.kg“^.K“^) 


1074 


Fp^cat 


Heat capacity of catalyst (J.kg“^.K“^) 


1005 


Fp^steam 


Heat capacity of dispersing steam (J.kg“^.K“^) 


1900 


A 


Weight fraction of steam in feed stream to riser 


0.035 


kio 


Reaction rate constant for cracking of gas oil 


9.65 X 10^ 


kso 


Reaction rate constant for cracking of gasoline 


4.22 X 10^ 


Faf 


Activation energy for gas oil cracking (J.moR^) 


101.5 X 10^ 


Fag 


Activation energy for gasoline cracking (J.moR^) 


112.6 X 10^ 


m 


Empirical deactivation parameter 


80 


a 


Catalyst decay rate constant (s“^) 


0.12 


0^2 


Fraction of the gas oil that cracks to gasoline 


0.75 


^F-p^erack 


Heat of reaction for gas oil cracking (J.kg“^) 


506.2 X 10^ 


keb 


Reaction rate constant for coke combustion 


2.077 X 10^ 


Facb 


Activation energy for coke combustion (J.moR^) 


158.59 X 10^ 


^cat,reg 


Holdup of solid in regenerator (kg) 


175738 


"bklcat,sep 


Holdup of solid in separator (kg) 


17500 


'^air,reg 


Holdup of air in regenerator (mol) 


20000 


ncH 


Number of hydrogen in coke of formula CH^ 


2 


^coke 


Molecular weight of coke (kg.moR^) 


14 X 10-3 


AHi 


Parameter in heat of reaction for coke combustion 


521.15 X 103 


AH2 


Parameter in heat of reaction for coke combustion 


245 



riser Tris{'^) and the temperature in the regenerator T^eg- The sampling time 
is taken to be equal to 250 s. 

To hnd the steady states, hrst a nonlinear optimization is realized to obtain 
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Table 16 . 2 : Steady- state values of main variables 



Symbol 


Significance 


Value 


T«s(0) 


Temperature in the riser at z = 0 (K) 


805.48 


Trisil) 


Temperature in the riser at z = 1 (K) 


780.68 


Treg 


Temperature in the dense bed of the regenerator (K) 


972 


Fc,reg 


Coke concentration in the regenerator (kg/kg) 


0.0038 


Cc,ris{'^) 


Coke concentration in the riser at z = 1 (kg/kg) 


0.01045 


X02 


Oxygen mole fraction in the regenerator 


0.0047 


Vgo 


Weight fraction of gas oil in the riser at z = 1 


0.4825 


Vg 


Weight fraction of gasoline in the riser at z = 1 


0.3680 



the values of the flow rate of air Fair^ the feed temperature and the rate 

constant for catalytic coke formation kc so that the regenerator steady-state 
balances are verified, kc depends on the feed oil composition. Then, a dynamic 
simulation based on these values is realized, which confirms the steady state. 



Open-Loop Responses 

Open-loop responses have been obtained for step variations of 5% of the mani- 
pulated inputs around the steady state (Figs. 16.4 and 16.5). 

For steps of the air flow rate, the responses are classical. However, for steps 
of the catalyst flow rate, the temperature at the riser exit first shows a sharp 
transition because of the direct influence of the concerned manipulated input 
on the studied output, then an inverse response. The regenerator temperature 
also shows an inverse response. 



Quadratic Dynamic Model Control of the FCC 

The model horizon is = 60, the prediction horizon Hp = 20 and the control 
horizon He = 3. To avoid the influence of the inverse responses, a minimum 
bound is imposed on the prediction equal to i^rnm = 10 so that the predicted 
outputs considered for optimization are between and Hp. The inputs 

and outputs considered in the quadratic criterion are normalized so that simple 
weights F and A can be considered. 

First case: the weights on the inputs and outputs are all equal to 1. Al- 
though this is the simplest set of weights which can be used, the outputs sat- 
isfactorily follow the step set points and the inputs vary smoothly (Fig. 16.6). 

Second case: the weights on the inputs are all equal to 1 and on the outputs 
all equal to 10. The tracking is improved with respect to the first case at the 
expense of larger, although quite acceptable, input variations (Fig. 16.7). 
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Figure 16.4: Variations of the temperature at the outlet of riser Tris{'^) and 
the temperature in the regenerator T^eg for air flow rate steps of 5% 
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Figure 16.5: Variations of the temperature at the outlet of riser Tris{'^) and 
the temperature in the regenerator T^eg for catalyst flow rate steps of 5% 



Observer-based Model Predictive Control of the FCC 

The observer-based model predictive control (OBMPC) described in Sect. 16.2.7 
has been applied and its performances can be compared to the QDMC strategy. 
An important difference is that in OBMPC, a linear Kalman hlter used as a 
state observer is implemented, thus simulated measurements are performed. It 
has been assumed that the temperatures at the riser exit and in the regenerator 
are random variables following the normal law with a given standard deviation. 
The noise is blank, i.e. its mean is zero. The model horizon is = 60, the 
prediction horizon Hp = 20 and the control horizon He = 3. The covariance 
matrix Q of the Kalman hlter is scalar and equal to the identity matrix. The 
covariance matrix R of the Kalman hlter is also scalar, with a factor equal 
to the variance of the measurements. The initial states have been taken to 
be equal to the steady-state outputs. Normalized weights T and A have been 
taken in the quadratic criterion. 

First case: to be able to make a comparison with QDMC, the standard 
deviation of the measurements has been taken to be equal to 0.01 so that the 
measurement noise is negligible. The resulting outputs obtained with OBMPC 
(Fig. 16.8) are very similar to those obtained with QDMC (Fig. 16.6) in 
similar conditions except for the presence of measurements. The weights on 
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Figure 16 . 6 : First case of quadratic dynamic matrix control of the FCC. Top: 
controlled outputs. Bottom: manipulated inputs 



the outputs were equal to 10 while they were 1 on the inputs. 

Second case: the standard deviation of the measurements has been taken 
to be equal to 0.5 so that the measurement noise is noticeable. The normalized 
weights on the outputs and inputs were taken to be equal to 1, in order to avoid 
brutal input moves. The resulting measured outputs obtained with OBMPC 
(Fig. 16.9) correctly follow the step set points in spite of the present noise. 
The corresponding inputs are acceptable and do not show sharp variations, in 
general, as expected from the chosen weights. 
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Part V 

Nonlinear Control 




Chapter 17 



Nonlinear Geometric 
Control 



Most chemical processes display nonlinear behaviour, whether they are chemi- 
cal or biological reactors, neutralization reactors and distillation columns. To 
control their operation around a steady-state point, a linear model, obtained by 
linearization or identihcation, is frequently used for designing the control law. 
In some cases of highly nonlinear behaviour or during the transient regimes 
such as startup and shutdown, in the case of batch or fed-batch processes, a 
linear controller is insufficient to guarantee the stability and performance. It is 
possible to use gain programming controllers, which need to test the controller 
tuning for all the operating domains crossed. Adaptive control itself is not 
always satisfactory and, in particular, can pose robustness problems. On the 
other hand, it is often possible to have at our disposal a nonlinear knowledge 
model of the process, which may be more or less accurate. 

The theory of linear system control is much more developed than that of 
nonlinear system control, but the progress realized in the nonlinear domain is 
such that the designer of a control system nowadays can hnd a certain number 
of efficient tools, provided that he or she possesses nonlinear knowledge model 
of the process. 

Among the different methods of nonlinear control, the theory related to 
differential geometry will be discussed in this chapter and the reader will be 
able to refer to the two applications of nonlinear geometric control concerning 
a chemical reactor and a biological reactor developed in Chap. 19. Other 
methods exist, but the proposed method is often the most employed, as it allows 
us to answer, in particular, questions of reachability, observability and feedback 
linearization. The problem is to algebraically transform the dynamics of a 
nonlinear system into partially or totally linear dynamics by a transformation 
acting on the states, which differs totally from the linear approximation of 
dynamics by calculation of the Jacobian. 
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17.1 Some Linear Notions Useful in Nonlinear 
Control 



The theory of nonlinear control is an extension of the theory developed origi- 
nally for linear systems (Wohnam, 1985). For this reason, a certain number of 
notions having common links with linear and nonlinear systems are discussed 
in a form which emphasizes the similarities between linear and nonlinear theory 
(Isidori, 1989; Khalil, 1996; Kravaris and Kantor, 1990a; Slotine and Li, 1991). 

Initially, we consider single-input single-output systems in the most general 
form 



( X = f{x,u) 

X y = h{x) 



(17.1) 



where x is the state vector of dimension n, u the control input and y the 
controlled output. In fact, in chemical engineering, most systems are affine 
with respect to the input 



X =f{x)+g{x)u 
y = h{x) 



(17.2) 



f{x) and g{x) are respectively called vector fields of the dynamics and the 
control. 

In the case of a linear system 



X = Ax -\- B u 

y =Cx 



(17.3) 



the vector fields are 

f{x) = Ax ; g{x) = B ; h{x) = Cx 
where A, B and C are matrices of respective dimensions n x n, n x 1 and 1 x n. 

17.1.1 Influence of a Coordinate Change in Linear 
Control 

The states x represent a set of coordinates which describe the time evolution 
of this system. The study of dynamic systems frequently requires a coordi- 
nate change. In the linear framework, a coordinate change is defined by a 
nonsingular similarity matrix T such that the new coordinates are 

C = Tx 

The linear system (17.3) is thus transformed into 

/ e =A^^Bu 
I y =c^ 

A = TAT~^ ; B = TB ; C = C T~^ 



(17.5) 

(17.6) 



with 



(17.7) 
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The two linear systems (17.3) and (17.6) have the same transfer function 



r(g) 

U{s) 



C{sl-A)-^B = 



CAd]{sI-A)B 
det(s I — A) 



(17.8) 



and represent two different realizations of the same input-output system. A 
series expansion of transfer function (17.8) gives 



(17.9) 



where the quantities C C A C B^ . . ., C A^ B^ ... , are called Markov 
parameters of the system, which are sufficient to completely determine the 
system transfer function. 



17.1.2 Relative Degree 

The degree of the denominator det{s I — A) of transfer function (17.8) is always 
equal to the system order n, but the degree of numerator C Adj(5/ — A) B is 
included between 0 and n — 1. The relative degree of linear system (17.3), also 
called relative order or characteristic index, is equal to the difference between 
the degrees of denominator and numerator of transfer function (17.8). 

Definition (Hirschorn, 1979): the relative degree of linear system (17.3) is 
defined by 

C A^ B = 0 for all: k<r — l . ^ s 

CA^-^B^O 

The relative degree r is the smallest integer such that: C A^~^ B ^ 0. The 
system (17.3) will thus have a relative degree r such that 

r = l if C B 

r = 2 if CB = d and CABj^O (17.11) 

r = 3 if C B = CAB = 0 and CA^B^d,... 

The relative degree, if it exists, is necessarily such that: 1 < r < n. 

The relative degree can also be considered from a different angle by taking 
into account the successive time derivatives of the output y 



dt 



C Ax 



dr-i 

dd'y 

dB 



CA^-^x 

CA^x + CA^-^Bu 



(17.12) 



by using the definition (17.10) of the relative degree. The relative degree is thus 
the smallest degree of differentiation of the output y which depends explicitly 
on the input u. 
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The design of a controller, e.g. in internal model control, often makes use 
of the inverse of the process transfer function. The inversion of a linear system 
consists of finding a state-space realization of the transfer function of the inverse 
of the model, that is 



U{s) _ det{sl -A) 

~ CAd:i{sI -A)B 



(17.13) 



The dynamic system 



BCA^ B (Fy 

^ ~ CA^-^B dt^ 

1 d^y CA^ 

C A'-^B dt'' ~ 



(17.14) 



is a state-space realization of the inverse (17.13) of system (17.3) assumed to 
be of relative degree r (Kravaris and Kantor, 1990a). However, this is not a 
minimal order realization of the inverse. Indeed, the order of dynamic system 
(17.14) is n, whereas the degree of transfer function (17.13) is n — r. This can 
also be verified by calculating the transfer function of (17.14): there are r pole 
simplifications at the origin. 



17.1.3 Normal Form and Relative Degree 

The vectors (C, C A^ . . ., C A^~^ are linearly independent. We assume h\ ^ 0. 
In these conditions, the transformation 

a 



a =cA^-^x 
^ _ a+i 

a+i — ^r+l ~~r 
bi 



(17.15) 



sn — 7 

bi 

is invertible and transforms system (17.3) into the following system 
^1 = ^2 



1 



= & 



ir 


= fii 


a+1 


+ 7i 


■ a ■ 






_ 







a +1 




a +1 




■ a ■ 




= A 




+ r 




_ in 




_ 










+ pu 



(17.16) 
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where A, P, ai, 71 are matrices of respective dimensions (n — r) x (n — r), 
(n — r) X r, 1 X (n — r), 1 x r, and /3 is a nonzero scalar. 

The dynamic system (17.16) is a different realization of transfer function 
(17.8) and constitutes a normal form of any linear system of relative degree r. 
A remarkable property of the realization (17.16) is that the (n — r) eigenvalues 
of matrix A are the zeros of transfer function (17.8) by noticing that the transfer 
function of system (17.16) is equal to 



C/(s) 



f3 det(s I — A) 



det(s I — A) 





■ 1 


\ 




■ 1 




s 






s 


s’' - 7i 






— di Adj(s I — A)r 






_ 5^-1 _ 






_ 5^-1 _ 



(17.17) 

From the normal form (17.16), it appears that the relative degree r is the 
number of integrators that the input u must cross before affecting the output 
y = The normal form also allows us to obtain a minimal order realization of 
the inverse system (17.13). By using the derivatives of the output, the system 
(17.16) becomes 



a 

Cr—l 



= y = ^2 

(F~^y 



dt^ 



dr 



f 



= ai 



a+i 




a+1 




= A 




_ in 




_ a 



^r+l 

+ r 



+ 7i 



y 

dt 



y 

dt 



L dr-i J 



+ f3u 



d"' ^y 

dt^-^ J 



which gives the control 



1 

0 









’ y 


\ 


d^'y 


a +1 










dt 




«i 

dr 


_ a 


-71 


d^-^y 










1 dt^-^ J 


> 



(17.18) 



(17.19) 
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In this form, only the states , . . . , intervene, and they are denoted 
by (zi, . . . , Zn-r) by subscript translation. It is possible to deduce a minimal 
realization of the inverse system (17.13) 







' 




■ y 




= A 




+ r 


dt 


Zfi — r 




Zfi — r 







u 



1 


/ 

d'^y ~ 


' 




— a\ 

dt'' 






< 


Zn — r 





■ y 










7i 


dt 






d^~^y /dt^~^ 





(17.20) 



17.1.4 Zero Dynamics 

The zeros of a linear system are the roots of the numerator polynomial of 
transfer function (17.8) and can be considered as the poles of its inverse (17.13). 
As the realization (17.20) of the inverse is minimal, the zeros of transfer function 
(17.8) are the roots of det(5/— A) = 0. The zero dynamics of a system is defined 
as the dynamics of its inverse. Thus, the dynamics of the last (n — r) equations 
of the normal form (17.16) completely determines the zeros of the system, i.e. 







' 




' y 

dy/dt 


Zfi — r 


= A 


Zfi — 7 - 


+ r 


dd'~^y/df'~^ 



The dynamic system (17.21) written as 







' 




' Ui 




= A 




+ r 




Zfi — f> 




Zfi — f^ 




Uf' 



(17.21) 



(17.22) 



is called the forced dynamics of system (17.3), while the dynamic system 







' 


Zfi — f^ 


= A 


Zfi — f^ 



(17.23) 



is called the unforced dynamics or internal dynamics of system (17.3), or zero 
dynamics of system (17.3). Actually, the system (17.23) completely determines 
the zero dynamics. The stability of the internal dynamics is thus determined 
by the positions of the zeros. 
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Figure 17.1: Static state feedback control 



17.1.5 Static State Feedback 



Consider the linear system (17.3) subjected to the static state feedback control 



u = Xv — Kx (17.24) 

where v is an external scalar input (Fig. 17.1), A a scalar gain for external 
input precompensation and K a state feedback gain vector of dimension n. 
The resulting closed-loop system is 



r X = {A — B K)x -\- X B V 
[ y =Cx 



(17.25) 



The static state feedback (17.24) presents the following properties: 

— It preserves the linearity, the relative degree and the zeros of the system. 

— It modihes the poles of the system (which is the aim of the state feedback). 
A static state feedback control law can be dehned for the system (17.3) of 
relative degree r. The order r derivative of the output y of system (17.3) is 
equal to 

y(r) = C A^x + C Bu (17.26) 

as C A~^~^ B 0. The static state feedback control law results 



V — C X 

CA^-^B 



(17.27) 



with V = . Note that the static state feedback control law (17.27) is a 

minimal order realization of the inverse of system (17.3). The linear system 
(17.3) subjected to the static state feedback (17.27) becomes 



X = {A — B 
y = C X 



CA^ 



CA^-^B 



)x ^ B 



CA^-^B 



(17.28) 



17.1.6 Pole-Placement by Static State Feedback 

Consider a linear system in its controllable canonical form. The associated 
dynamic system is 

±1 = X2 



Xn—1 — Xji 

±n = —AnXi — an-lX2 - ... - a\Xn + bu 



(17.29) 
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and its characteristic polynomial is 

^ + . . . + -\- OLfi (17.30) 

Suppose that we desire a static state feedback that places the poles such that 
the characteristic polynomial becomes 

+ OLlS^ ^ + . . . + OLrf^—iS + OLfi (17.31) 

The static state feedback realizing this pole-placement is equal to 

u = - {v — [an — • • • a\ — ai] x) (17.32) 

b 

For any controllable linear system such as 

x = Ax^Bu (17.33) 

the previous result is easily generalized by considering the transformation T 
such that 

" Q 
QA 

^ = T X = . X (17.34) 

_ QA^-^ _ 

where Q is a row vector satisfying 

QB = 0 
QAB = 0 

: (17.35) 

QA^-^B =0 
QA^-^B 7^0 

This transformation transforms the system (17.33) into 

ii = 6 

• (17.36) 

Cn—l — Cn 

in =QA^T-^C^QA^-^Bu 

From Eq. (17.32), the static state feedback realizing the pole-placement (17.31) 
is thus equal to 

U = “ ([“« ■■■ ai] + QA^ T~^) T x) (17.37) 

which can be transformed, according to Ackermann’s formula 

^ ~ Q A^~^ B ~ O'lQ A^ ^ + . . . + an-i Q A^ anQ)x) (17.38) 

Among the possible vectors Q, the following vector is suitable 
Q = last row of [B AB A^~^B 
with, moreover, QA^~^B = 1. 



(17.39) 
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17.1.7 Input-Output Pole-Placement 



Consider the linear system (17.3) assumed of relative degree r. We seek a state 
feedback which imposes a given transfer function for the closed-loop system. 
The static state feedback preserves the relative degree of the system, thus the 
simplest closed-loop transfer function can be written as 



r(g) 1 

V (s) s'^ -\- + . . . + f3-p—iS -\- (3r 



(17.40) 



The static state feedback which, when applied to the system (17.3), gives trans- 
fer function (17.40), is equal to 

^ ~ ~ (^^^ T PiCA^ ^ + . . . + (3r-iCA + f3rC) x] (17.41) 

By identihcation, the general static state feedback control law (17.24) gives 



CA^ (17.42) 

K (CA^ ^ (3iCA^ ^ ^ ^ (3r-lCA^ (3rC) 

The corresponding closed-loop characteristic polynomial is equal to 

dei{sI-A + BK) = — -4— CAdj(s/-A)S(s’'+/3is’'-i+...+/3^_is + /3^) 
Cy A^ ^ B 

(17.43) 

According to this expression, we notice that the state feedback (17.41) places 
the closed-loop poles at the process zeros and the desired poles specihed by Eq. 
(17.40). The closed-loop system can thus only be stable (internal stability of 
the states) if its zeros have a negative real part, thus if the system is minimum- 
phase. 



17.2 Monovariable Nonlinear Control 



17.2.1 Some Notions of Differential Geometry 

Several textbooks have been published recently in the domain of nonlinear 
control, in particular the following: Fossard and Normand-Cyrot (1993); Isidori 
(1989, 1995); Nijmeijer and VanderSchaft (1990); Slotine and Li (1991); Vidyasagar 
(1993). In order to master the concepts which will be used in this chapter, some 
elementary notions of differential geometry must be introduced. 

We consider nonlinear systems, affine with respect to the input, such as 



( X = f{x) +g{x)u 

\ y = h(x) 



(17.44) 



where /(x), g{x) are smooth mappings and h{x) a smooth function^. f(x) and 
g{x) are frequently called vector fields. 

^ A function or a mapping is said to be smooth if it possesses continuous partial derivatives 
of any order; mathematically the function is . 
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Given two real vectors V and W, the scalar product of the two vectors is 
denoted by 

n 

<V,W >=V'^W = '^ViWi (17.45) 

i=l 



We use the following notation for the gradient vector (taken as a row vector) 
of a function A 



DX{x) 



dX 

dx 



dX dX 

dxi dXn 



(17.46) 



The gradient (as a row vector) of a function A is thus the Jacobian matrix of 

A. 

The vector fields ^i(x), . . . ,^^(x) are linearly independent if, for all x, the 
vectors ^i(x), . . . , gm{x) are linearly independent. 

The scalar fields zi(x), . . . , Zm{x) are linearly independent if, for all x, their 
gradients Dz\{x )^ . . . , Dzm{x) are linearly independent vector fields. 

The derivative of a function A(x) in the direction of the field / (directional 
derivative) is defined by 



^ M^) =< /(^) > 



i=l 



dx 



(17.47) 



This derivative is called the Lie derivative and plays a very important role in 
nonlinear control. As a matter of fact, for the system (17.2) 



dt dxi dt 

dJi 

= J27=i-^,iM^)+9i{x) u) 

= Lfh{x) + Lgh{x) u 



(17.48) 



thus the time derivative of the output is simply expressed with respect to the 
Lie derivatives. The Lie derivative LfX{x) is the derivative of A along the 
integral curves^ of the vector field /. 

It is possible to realize successive differentiations, such as the differentiation 
of A in the direction of /, then in the direction of that is 

LgLf\{x) = ^^g{x) (17.49) 

or further, to differentiate A, k times in the direction of / 

dL^~^X 

LjA(x) = — ^ f{x) with: LjA(x) = X{x) (17.50) 

^ Given the state-space system 

x{t) = f{x{t)) ; a?(0) = 



the solution x{t) passing by x° defines a curve called the integral curve. 




Process Control 



629 



The Lie bracket is defined by 






(17.51) 



bian matrix of g) 



nf( \ 

= s = 



of / equal to (same 


- m 


^ ■ 


dxi 


dXn 


% 


% 


_ dxi 


dXn . 



(17.52) 



It is possible to repeat the operation on the Lie bracket of g by iterating 
on /, but in this case, rather than denoting by [/, [/,...,[/, ^]]]], the following 
notation is used 



adj^(x) = [/, adj ^g]{x) for k>l with: adj^(x) = g{x) (17.53) 

The Lie bracket is a bilinear, skew- symmetric mapping and satisfies the Jacobi 
identity 

[/, [g, p\] + [g, [p, /]] + [p, [f, 5]] = 0 (17.54) 

where f^g^p are vector fields. 



17.2.2 Relative Degree of a Monovariable Nonlinear 
System 

Consider the single-input single-output nonlinear system (17.2). The relative 
degree, or relative order, or characteristic index, is defined (Hirschorn, 1979) 
by: 

Definition: the relative degree of the nonlinear system (17.2) over a domain U 
is the smallest integer r for which 

LgU'^~^h{x) ^ 0 forallxinf/ (17.55) 

This relation is consistent with the definition of the relative degree for linear 
systems by noticing that, for a linear system, Eq. (17.55) becomes 

LgLy^h{x) = CA^-^B ^ 0 (17.56) 

The nonlinear system (17.2) will thus admit a relative degree r equal to 



r = 1 
r = 2 
r = 3 



if Lgh{x) 7^ 0 

if Lgh{x) = 0 and LgLfh{x) 7^ 0 

if Lgh{x) = LgLfh{x) = 0 and LgL‘j^h{x) 7^ 0 



(17.57) 
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Isidori (1995) defines the relative degree at a point x° if the definition (17.55) 
is valid at any point x of the neighbourhood U ; however, the first Lie derivative 
Lgh{x) of the sequence LgL^~^h{x) can be accidentally zero at x°. In this case, 
the relative degree cannot be defined strictly at but will be defined in the 
neighbourhood U (notion of dense open subset). This point will be present in 
the following, although it will not be systematically recalled. 

The condition (17.55) can be transformed, by use of the Leibnitz formula 
(Kravaris and Kantor, 1990a), and formulated as: the relative degree of the 
nonlinear system (17.2) is the smallest integer r such that 

(17.58) 

As in the linear case, the interpretation of the relative degree r can be 
obtained from the time derivatives of the output y 

= Lfh{x) + Lgh{x) u 
= Lfh{x) if 1 < r 

(17.59) 

= L)h{x) + LgLy^h{x)u 
= Lfh{x) ii k < r 

= Ufh{x) + LgLy^h{x) u as LgV'f'^h{x) ^ 0 

The relative degree is equal to the number of differentiations of the output y 
that are necessary to make the input u explicitly appear. 

If we obtain 



dt 



d^y 

dt^ 

dV 



LgL^h{x) = 0 for all /c , for all x in [/ (17.60) 

the relative degree cannot be defined in the neighbourhood of x° and the output 
is not touched by the input u. 

The row vectors Dh{x)^ DLfh{x )^ . . . , DU'^~^h{x) are linearly independent 
(Isidori, 1995). This can be proved by showing that the matrix 



Dh{x) 

DLfh{x) 

DLy^h{x) 



g{x) d.dfg{x) 



adj ^g{x) 



(17.61) 



has rank r. This is an important point, as the r functions /i(x), Lfh{x)^ . . ., 
U'j~^h{x) can form a new set of coordinates in the neighbourhood of point 



17.2.3 Probenius Theorem 

The Frobenius theorem is related to the solving of a first-order partial differen- 
tial equation: 
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a) First, suppose that we have d smooth vector helds fi{x)^ dehned on 
which span a distribution^ A, denoted by 

A = span{/i(x),...,/d(x)} (17.62) 

In the same neighbourhood, the codistribution H, of dimension n — d is spanned 
by n — d covector helds: cji, . . . , uOn-d such that 



< cjj(x), fi{x) >= 0 Vl<i<d,l<j<n — d (17.63) 

Because of this property, we denote the codistribution: H = A^, and coj is the 
solution of the equation 

ujj{x)F{x) = 0 (17.64) 

where F{x) is the matrix of dimension n x d, of rank d, equal to 

F{x)=[fi{x) ... fd{x) ] (17.65) 



The row vectors coj form a basis of the space of the solutions of Eq. (17.64). 

b) We assume that we are looking for solutions such that 



dx 



(17.66) 



corresponding to smooth functions Aj, i.e. we are looking for n — d independent 
solutions (the row vectors dXj jdxi^ . . . ^ dXj jdxn-d are independent) of the 
following differential equation 



BX BX 

= -^ [ /i(a^) ■■■ fd{x)]=0 (17.67) 



c) We seek the condition of existence of n — d independent solutions of dif- 
ferential Eq. (17.67), which is equivalent to seeking the integrability of the 
distribution A: a distribution of dimension d, defined on an open domain U of 
R^, is completely integrable if, for all point x° of [/, there exist n — d smooth 
functions, having real values, defined on a neighbourhood of x°, such that 



The condition of existence is provided by the Frobenius theorem. 

d) Frobenius theorem: a distribution is nonsingular if and only if it is involu- 
tive^ . 

In the case where F is reduced to only a vector field /i, Eq. (17.67) can be 
geometrically interpreted as: 

^The smooth vector fields fi (x), . . . , fd{x) span at a point x oiU a vector space dependent 
on X that can be denoted by A(x). The mapping assigning this vector space to any point x 
is called a smooth distribution. 

distribution A is involutive if the Lie bracket of any couple of vector fields belonging 
to A belongs to A 

fi and /2 e A [/i, /2] G A 
[fiJj]{x) = Yl'k=i(^ijkfk{x) vf,j 
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— The gradient of A is orthogonal to /i . 

— The vector /i is tangent to the surface A = constant passing by this point. 

— The integral curve of /i passing by this point is entirely on the surface 
A = constant. 



17.2.4 Coordinates Change 

A function ^ of in R^, defined in a domain [/, is called a diffeomorphism 
if it is smooth and if its inverse exists and is smooth. If the domain U is 
the whole space, the diffeomorphism is global; otherwise, it is local. The diffeo- 
morphism is thus a nonlinear coordinate change that possesses the properties 
previously listed. 

Consider a function ^ defined in a domain U of R^. If and only if the 
Jacobian matrix d^/dx is nonsingular at x° belonging to ^2, ^(x) defines a 
local diffeomorphism on a sub-domain of ^2. 

A diffeomorphism allows us to transform a nonlinear system into another 
nonlinear system defined with regard to new states. 

Given the single-input single-output nonlinear system (17.2) of relative de- 
gree r at set 



(^i(x) = h{x) 

(t) 2 {x) = Lfh{x) 

(17.69) 



If r < n, it is possible to find n — r functions (^^+i(x), . . . , (^n(^) such that the 
mapping 



^(x) 



(f)l{x) 



(17.70) 



has its Jacobian matrix nonsingular and thus constitutes a possible coordinate 
change at 

The value of the functions (^^+i(x), . . . , (j)n{x) at has no importance and 
these functions can be chosen such that 



< D(j)i{x)^ g{x) >= Lg(j)i{x) = 0 for all r + 1 < i < n for all x inQ (17.71) 
The demonstration makes use of the Frobenius theorem (Isidori, 1995). 
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17.2.5 Normal Form 

The nonlinear system (17.2) can be described in the new coordinates z* = 
[y,y ,. . = 0i(x), (i = 1, . . . , n) from the relations (17.70) 



= Lfh{x{t)) = (f>2{x{t)) = Z2{t) 

^ = Ly^h{x{t)) = (prixit)) = Zr{t) 
' _ I T rr-l 7 /_//\\ _//\ 



dzi 


_ d<pi 


dx 


dh dx 


dt 


dx 


dt 


dx dt 


dZr-l 


d(j)r- 


-1 dx 


_ dLy 


dt 


dx 


dt 


dx 


dZr 


d(j)r 


dx 


dUf^h 


dt 


dx 


dt 


dx 



(17.72) 

The expression of Zr{t) must be transformed with respect to z{t) by using the 
relation x{t) = which gives 

H 7 

= U^h{^-\z(t))) + L,Uf^h{^-\z(t))) u(t) 

= b{z(t)) + a{z(t)) u(t) 

by setting 

a{z{t))=L,Ly^h{^-\z{t))) ; b{z{t)) = L}h{^-\z{t))) (17.74) 

and by noting that, by dehnition of the relative degree, at = ^(x°), we have: 
a{z°) 7 ^ 0. 

It is possible to choose the following coordinates r < i < n, according 
to Eq. (17.71) so that Lg(j)i{x) = 0, which gives 

dzi d<pi dx d(pi 

= Lf(pi{x{t)) + Lg(pi{x{t))u{t)) = Lf4>i(x{t)) 

= LfU^-\z{t))) 

We set 

qi{z{t)) = Lf(j)i{^~^{z{t))) ; r<i<n (17.76) 

By again considering all the equations, we obtain the normal form 



= ^2 

^r-1 = (17.77) 

Zr = b{z) + a{z) u{t) 

Zr+l =qr+l{z) 

Zn = qn{z) 

to which we must add the equation of the output 



y = h{x) = Zi 



(17.78) 
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Figure 17.2: Description of the normal form 



This result can be symbolized in the block diagram (Fig. 17.2) by displaying 
the chain of r integrators necessary to pass from the control input to the output. 
The condition Lg(j)i{x) = 0, which is fundamental for seeking the functions 
r < i < n, can be difficult to fill, as this condition corresponds to the 
solving of a system of n — r partial differential equations. To define a coordinate 
change, it can be sufficient to find these functions so that the matrix is simply 
nonsingular. 

17.2.6 Controllability and Observability 

Consider the nonlinear system 



X = f{x) + g{x) u 



(17.79) 



defined in a domain U. 

The system (17.79) is controllable if, given an initial state whatever two 
points xi and X 2 of [/, there exists an admissible input u such that the system 
passes from the state xi to the state X 2 in a finite time T. 

The controllability of this nonlinear system can be studied by proceeding 
to a linearization of the system 



df 

z=^z-\-g(x)v (17.80) 

ox 

and by studying the controllability matrix. However, this approach is not al- 
ways satisfactory; indeed, a nonlinear system can be controllable whereas its 
linear approximation is not. It is necessary to introduce the notion of reacha- 
bility (Isidori, 1995; Nijmeijer and VanderSchaft, 1990). 

For observability that also requires complex topological notions, both pre- 
vious books are recommended. Observability can be defined in an approached 
manner by: 

A system is observable if, given two different initial conditions x(0) and x'(0), 
there exists a control input u(t) defined in [0,T] such that the corresponding 
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outputs y{x^ u, t) and y'{x^ u, t) are not identically equal in [0, T] . The input u{t) 
distinguishes the initial conditions x(0) and x'(0) in [0, T]. If u{t) distinguishes 
any pair {x^x') in [0,T], the input u{t) is universal. 

17.2.7 Principle of Feedback Linearization 

The control law depends on the state values, which are assumed to be known. 
In the case where all the states are not known, it is necessary to couple a state 
estimator, called an observer, to the control system. When the state feedback 
control law depends only on the values of the states x and the external input 
n, it is a static state feedback. If the control law corresponds to the output of 
a dynamic system, itself depending on the states x and on the external input 
n, it is a dynamic state feedback. 

Consider the single-input single-output nonlinear system affine with respect 
to the input 

X = f{,x) + g{x)u (17.81) 

dehned in a neighbourhood U of and such that: /(x°) = 0. The problem of 
feedback linearization is to hnd smooth functions p and q with q{x^) ^ 0, and 
a diffeomorphism ^ with = 0 such that by dehning: 

— an external input v = p{x) + q{x) n, 

— the transformed variables z = ^(x), 
the resulting system is linear as 

z = Az^Bv (17.82) 

where the pair (A, B) is controllable. The new state z is called a linearizing 
state and the control law is a linearizing control law. 




Figure 17.3: Linearizing feedback 

With regard to the state x, the control law (Fig. 17.3) is 

u(t) = H — ^ = a(x) + (3(x) V (17.83) 

q{x) q{x) 

17.2.8 Exact Input-State Linearization for a System of 
Relative Degree Equal to n 

This linearization is often called exact linearization (Isidori, 1995). To start, 
consider the system (17.81) of relative degree r = n, thus equal to the dimension 
of the state vector, at a point 
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The coordinate change necessary to get the normal form is then 






and the normal form is 



■ 01 (x) ■ 




h{x) 


0n 




Ly^h{x) 



(17.84) 



= ^2 



• (17.85) 

— 1 — 

Zn = b{z) + a{z) u{t) 



with a(z°) 7^ 0 because of the definition of the relative degree. 



V = Zn 



/ 




Figure 17.4: Exactly linearized system by static state feedback 



If we choose the state feedback control law 

/ N b{z) 7 
u{t) = — H- + 



a{z) a{z) 

we obtain the resulting closed-loop system (Fig. 17.3) 



(17.86) 



= ^2 



• (17.87) 

— 1 ~ 

Zn =V 



which is a linear and controllable system expressed in the Brunovsky canonical 
form 



" 0 


1 


0 


.. O' 




■ 0 ■ 


0 


0 


1 


•. 0 






_ 0 






1 

.. 0 _ 




0 

_ 1 _ 



(17.88) 



To obtain the control law (17.86), a coordinate change and a state feedback 
which can be exchanged are used. If we first use the state feedback and then 
the coordinate change, we obtain the following control law 



5(4>(x)) ^ V 

~a($(x)) a($(x)) 

—Ujh{x) + V 

LgLY^h{x) 



(17.89) 
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which corresponds to the same controllable linear system (17.87). This control 
law is called linearizing state feedback and the coordinates ^ are the linearizing 
coordinates. 

Two remarks (Isidori, 1995) are particularly important: 

— We assumed that x° is a stationary point for the system (17.2), thus f{x°) = 
0 and h{x°) = 0, hence 



(f)i{x°) = h{x°) = 0 

dLi~^h 

4>i{x°) = — I /(x°)=0 , Ki<n 

OX 



(17.90) 



or =0. It is always possible to come back to h{x^) = 0 by an appropriate 
translation. 

— It is possible to realize a pole-placement or to satisfy an optimality criterion, 
by imposing a feedback (Fig. 17.5) as 



V 2 = K z , with the gain vector: iC = [cq . . . c^-i] (17.91) 

equivalent to 

V2 = Coh(x) + ciLfh{x) + . . . + Cn-iL'^f~^h{x) (17.92) 

which is a nonlinear state feedback with respect to x. The state feedback 
control law becomes, in this case. 



-L^h{x) + Zn -L^h{x) - JZi=o CiL)h{x) + v 
LgLy^h{x) ~ LgLy^h{x) 



(17.93) 



The external input 'C, for example, could be equal to the set point ^re/- 
When 'c = 0, this corresponds to the local asymptotic equilibrium z = 0 
which is preserved. 




Figure 17.5: Nonlinear control with pole-placement for a system of relative 
degree equal to n 

By explaining the transfer function T(5)/F(5), the characteristic polyno- 
mial is then equal to 



Co T Cl 5 + . . . + Cji—i ^ + 5^ (17.94) 



whose coefficients can be chosen so as to realize the adequate pole-placement. 
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By simply considering the system (17.81) without output, Isidori (1995) 
shows that the exact input-state linearization is possible in a neighbourhood 
U of if and only if there exists a scalar function A(x) such that the system 
with the “output” redefined 



X = f{x) + g{x) u 

y = A(x) 



(17.95) 



has a relative degree equal to n at The function A(x) is equal to z\{x). 

It is possible to obtain the following general theorem (Hunt et ah, 1983; Su, 
1982): 

Theorem: 

the system (17.81) is exactly linearizable in state space (input-state lineariza- 
tion) in a neighbourhood U of if and only if the following conditions are 
satisfied: 

1. The vector fields {^(x°), ad/^(x°), . . . , adj~^^(x°)} are linearly indepen- 
dent. 

2. The distribution span{^, adjg ^ . . . , a(TJ~^g} is involutive in U. 

Condition 1 that can be written as “the following matrix 



[g{x°),adfg{x°),...,ad^ ^g{x°) 

has rank n” is a controllability condition of the nonlinear system. This matrix 
must be invertible. In the linear context, this matrix is the controllability 
matrix 



To realize the exact input-state linearization, we must proceed according to 
the following stages: 

• Build the vector fields g{x°)^ adfg{x °)^ . . . , ad'^~^g{x°). 

• Check if the conditions of controllability and involutivity are verified. 

• If these conditions are verified, find the function A(x) from the equations 

Lg\{x°) = LgLf\{x°) = ... = LgLy^\{x°) = 0 
LgL}-^\{x°) + 0 

• Calculate the coordinate change 



^{x) = [A(x), L/A(x), . . . , Ly^X{x)] (17.96) 



17.2.9 Input-Output Linearization of a System with 
Relative Degree r Lower than or Equal to n 

Two cases are distinguished: 

• (Isidori, 1995) notices that a nonlinear system such as 

( X = f{x) + g{x) u 

\ y = h(x) 



(17.97) 
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having a relative degree r < n can satisfy the conditions of the previous 
theorem. In this case, there exists a different “output” A such that the 
system has a relative degree equal to n. The new system thus defined 
satisfies the previous theorem; by using a feedback u = a{x) -\- (3{x) v and a 
coordinate change ^(x), it is transformed into a controllable linear system, 
but the real output, in general, is not linear with respect to the new 

y = h{^-\z)) (17.98) 



• If the output y is fixed by ^ = h{x) and if the system possesses a relative 
degree r lower than or equal to n, by using the coordinate change ^(x), it 
is possible to transform the system into the normal form (17.77) and to set 
V = Zr so that the system is simply expressed in the transformed coordinates 
in Byrnes-Isidori canonical form 





= Z2 


ir-1 


= Zr 


Zr 


= V = b{z 


^r+1 


= qr+l{z) 


Zn 


= Quiz) 


y = zi 





with 

b{z) = L'fh{x) , a{z) = LgU'^~^h{x) 
The control law is deduced 



u{t) 



b{z) ^ 

a{z) a{z) 
—U^h{x) + V 

LgUf^h{x) 



(17.99) 



(17.100) 



(17.101) 



The resulting system is only partially linear, but the output is infiuenced 
by the external input v only through a chain of r integrators (Fig. 17.2) 
related to the new states , . . . , 



= L^j^h{x) ^ LgL^j^ ^h{x)u = v (17.102) 

The new states Zr+i, • • • , which constitute the nonlinear part of the sys- 
tem do not infiuence the output y. 



17.2.10 Zero Dynamics 

In the case of a linear system of relative degree r = the transfer function does 
not possess any zeros. Similarly, for a nonlinear system, for that purpose, we 
consider only the case where the relative degree r is lower than n. We represent 
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the vector in normal form (17.77) by separating the linear part of dimension r 
and the nonlinear part of dimension n — r, thus 



' 




' y 


^2 




y 








Zr 




_ y(r-l) _ 



allowing us to rewrite the system as 



7 ] = 



^r+l 



(17.103) 



= ^2 



ir = b{^, 7]) + a{^, 7]) u{t) 

V =q{^,v) 



(17.104) 



where ^ and 77 constitute the normal coordinates or normal states. 

The dynamics of the nonlinear system is thus decomposed into an external 
input-output part and an internal unobservable part. It is simple to conceive 
the external part, but there remains the problem of the internal stability cor- 
responding to the last (n — r) equations: f] = 77 ). 

As x° is an equilibrium point of the system, we obtain f{x°) = 0 and we 
can choose h{x°) = 0. We can assume that, in the normal coordinates, the 
point ( 0 , 0 ) is the equilibrium point, hence 6 ( 0 , 0 ) = 0 and q{0, 0 ) = 0 . 

We seek to make the output zero for all t in the neighbourhood of t = 0. In 
the normal form, this would amount to imposing 



zi = . . . = Zr = 0 = 0 for all t (17.105) 

as, moreover, we maintain y = z\ =0. The input u results such that 



0 = 6 ( 0 , 77 ) + a( 0 , 77 ) u{t) 



(17.106) 



with a(0, 77 ) 7 ^ 0 still in the neighbourhood of 6 = 0. Moreover, the variable 77 
is such that 

77 = g'( 0 , 77 ) , with: 77 ( 0 ) = 77 ° (17.107) 

which is an autonomous system of differential equations whose solution is the 
variable 77 ( 6 ). We deduce the unique expression of the input that imposes a 
zero output in the neighbourhood of 6 = 0 



b{0,7]{t)) 









a{0,ri{t)) 



(17.108) 



The dynamics of Eq. (17.107), which results from the condition of zero output, 
is called zero dynamics or unforced zero dynamics; it describes the internal 
behaviour of the system. 
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The search of the zero output could have been realized in the original state 
space, by setting 

y{t) = y{t) = . . . = {t) = y^^^ {t) = 0 for all t (17.109) 



or further, in the neighbourhood of 

h{x) = Lfh{x) = . . . = U'^~^h{x) = 0 
U^h{x) + LgLy^h{x) u(t) = 0 



(17.110) 



The case of tracking a reference output is deducted from the previous 
case of a zero output. We set in the neighbourhood of t = 0 



y{t) = yref(^) 

which gives, for the new coordinates 
By analogy with the previous case, we set 





' 






*^ref 


^2 


= 


^ref 




Zr 




(r-l) 

- ^ref 



The equation that imposes the control results 

where r] is the solution of the autonomous differential system 



(17.111) 



(17.112) 



(17.113) 



(17.114) 



V{t) = q{^j;ei{t),v{t)) , with: rj{Q) = rj° (17.115) 

We draw the equation of the unique control imposing on the output to exactly 
follow the reference 



_ 4ef(^) “ ^(^ref(^)’^(^)) 



(17.116) 



The system of differential Eqs. (17.115) coupled with Eq. (17.116) gives the 
forced zero dynamics or dynamics of the inverse of the system (17.97), corres- 
ponding to a control such that the output exactly follows the reference. 77 is 
the state of the dynamics of the inverse, its control input and u its output. 
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17.2.11 Asymptotic Stability 

We consider the system in its normal form 



= ^2 






(17.117) 



by considering as previously that ((f , rj) = (0, 0) is an equilibrium point. We 
consider a state feedback close to Eq. (17.93), in this case 



V = —Kz , with the gain vector: K = [cq . . (17.118) 

The state feedback 



u{t) 



v) - E[=o CjZj+i 

a{^,ri) 

-Ufh{x) - CjUfh{x) 

LgLy^h{x) 



gives the closed-loop system 



h =q{^,v) 

where A is equal to 

"0 1 0 ... 0 

: 1 ; 

: 0 
0 ... 0 1 

—Co —Cl — Cr-1 



(17.119) 



(17.120) 



(17.121) 



which is a companion controllability matrix and has the characteristic polyno- 
mial 

Co + Cl 5 + . . . + Cr-1 s'^~^ + s'^ (17.122) 

If on one hand, the coefficients are chosen so that the roots of this polynomial 
have a negative real part and, on the other hand, the zero dynamics correspon- 
ding to t) = q{0, Tj) is asymptotically locally stable, the state feedback (17.119) 
stabilizes asymptotically locally the system (17.120) in the neighbourhood of 
the equilibrium (<f , 77 ) = (0,0). 

The role and the importance of zero dynamics thus appear clearly here. 
If the linear approximation of the system possesses uncontrollable modes, the 
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latter necessarily correspond to eigenvalues of the linear approximation Q of 
the zero dynamics. The linear approximation of the system is given by 



= ^2 



Zr = S T] K U 

f] = P^^Qt] 



(17.123) 



with the partial derivative matrices considered at (<f , 77 ) = (0, 0) 



'db 


, s = 


'db 


, P = 


dq 


, Q = 


dq 


M. 




dr] 




M. 


drj 



(17.124) 



Notice that it is not necessary that the linear approximation is asymptotically 
stable for the nonlinear system to be stable. 




Figure 17.6: Nonlinear control with pole-placement for a system of relative 
degree r < n 



As has already been realized for a system of relative degree n with the state 
feedback (17.93), we can take into account an external input v (Fig. 17.6) as 



u{t) 



-Ufh{x) - CiL}h{x) + V 

LgLph{x) 



(17.125) 



so that the system (17.117) is transformed into 



^ =A^ + Bv 

V =q{^,v) 



(17.126) 



with: S = [0 . . .01]^. Provided that the zero dynamics is stable, the stability 
will depend on the characteristic polynomial 



Co Cl S Cr-l s'^ ^ s'^ (17.127) 

whose coefficients will be chosen so as to realize the desired pole-placement. 

By analogy with the linear systems, a nonlinear system is called minimum- 
phase if its unforced zero dynamics is asymptotically locally stable at (0, 0). 
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17.2.12 Tracking of a Reference Trajectory 



So that the output y = z\ converges asymptotically towards a reference trajec- 
tory it is suitable to gather the elements of both previous sections: the 
forced zero dynamics with reference trajectory and the asymptotic stability. 

We still consider the system in its normal form (17.117), and the state 
feedback is 



u{t) 



a(^,r]) 

-Ufh{x) + - X:[=o Ci{L)h{x) - 

LgUf^h{x) 



The error defined by 



e(^) = y{t) - yref (^) 

is the solution of the following differential equation 

+ Cr-i + . . . + Cl + cq e = 0 



(17.128) 



(17.129) 

(17.130) 



whose parallel with the characteristic polynomial (17.127) is obvious. By choo- 
sing in an adequate manner the coefficients c^, it is possible to ensure the 
exponential convergence of the error towards 0 when t ^ oo. In the same way 
as the previous section, it is necessary that the zero dynamics (here forced) 
corresponding to 



?7 = «(^ref’^) ’ with: 77j.gf(0) = 0 (17.131) 



is stable (where (^) is the solution of this differential system) and with the 
vector 

^ref = [yref(^)’ ’ 4ef (17.132) 

Isidori (1995) studied the particular case where the reference depends 
on a linear model. In the case where an external input is taken into account, 
the state feedback (17.128) becomes 



u{t) 



V - Ufh{x) + - E[=o Cj{L)h{x) - 

LgUf^h{x) 



(17.133) 



17.2.13 Decoupling with Respect to a Disturbance 

We assume that a modelable disturbance d acts on the system thus reformulated 



J X = f{x) g{x) u w{x) d 
\ y = h{x) 



(17.134) 



We wish to define an input u by static state feedback such that the output y 
does not depend (is decoupled) on the disturbance d. Express the system with 
the normal coordinates 



zi = Lfh{x{t)) + Lgh{x{t)) u{t) + Lujh{x{t)) d{t) 



(17.135) 
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We use, on one hand, the relative degree r of the system, which gives Lgh = 
0 , if r > 1, and we impose L^h = 0 so that the disturbance has no influence. 
For the following coordinates, a similar condition 

L^Vf^h = 0 , 1 <i<r (17.136) 

is used, hence the system in the normal form including the influence of the 
disturbance from rank r + 1 

= ^2 



i,_i = z, (17.137) 

ir = Ujh{x{t)) + LgV'^~^h{x{t)) u{t) = b{^, rj) + a{^, rj) u 

V = + k{^,v)d 



with, moreover y = z\. It clearly appears from the normal form (17.137) that 
the input dehned by the static state feedback 



+v 



which gives 



Zr = V 



(17.138) 

(17.139) 



perfectly decouples the output y = z\ from the disturbance d. 

The decoupling condition is thus simply Eq. (17.136), which can be ex- 
pressed as 



< Lj ^/i, w >= 0 , 1 < i <r 

w{x) belongs to the codistribution in the neighbourhood of x° 

(17.140) 

where the distribution A is equal to 



A = span{m, DLfh, . . . , DUf^h} (17.141) 

The previous decoupling has been realized by state feedback. It is well 
known that when the disturbance can be measured it is advantageous to con- 
sider a feedforward term in the control law, such as 



u{t) = a{x) ^ [3{x)v ^ ^{x) d (17.142) 



hence the closed-loop system equations 

X = f{x) + g{x)[a{x) + !3{x) v] + [w{x) + g{x)^{x)] d 

y = h{x) 



(17.143) 



We can think in a way quite parallel to the decoupling by state feedback. It 
suffices that 

< Ly^h^w gj >= 0 , l<i<r 

[w{x) + g{xy{x)] belongs to the codistribution A^ in the neighbourhood of x° 

(17.144) 
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This condition can be simplified by explaining the Lie bracket 

< Ly^h, w+g^ > = Lyj+g^Vy^h{x) 

= L^L)-^h{x) + LgL)-^h{x)-i{x) = 0 , 

We draw the value of the function 7 

^ ^ L^L}-^h{x) 

l{x) = — Y 7 — 

LgL^ h[x) 

which is to be reinjected in control law (17.142). 

17.2.14 Case of Nonminimum-Phase Systems 

Rigorously, the nonlinear control law (17.128) cannot be applied to nonmi- 
nimum-phase systems, as they are not invertible. Nevertheless, it is possible 
(Slotine and Li, 1991) to apply to them a control law that gives a small tracking 
error or to redefine the output so that the modified zero dynamics is stable. 



1 <i <r 

(17.145) 

(17.146) 



17.2.15 Globally Linearizing Control 

This linearizing control of an input-output type proposed (Kravaris, 1988; 
Kravaris and Chung, 1987; Kravaris and Kantor, 1990a,b; Kravaris and Soroush, 
1990) relies on the same concepts of differential geometry as those which have 
been developed in the previous sections and can be considered as a close vari- 
ant. The formulation of the control law applied to a minimum-phase system of 
relative degree r is very close to Eq. (17.128) and is equal to 



V — U^h{x) — ^h{x) — ... — f 3 r-iLfh{x) — f 3 rh{x) 

LgLy^h{x) 



(17.147) 



which gives the following input-output linear dynamics 

+ /3i + . . . + /3^_i +l3rV = v (17.148) 



In order to guarantee a zero asymptotic error, even in the presence of modelling 
errors and step disturbances (for a PI), the external input can be provided by 
the following controller 

x{t) = f c{t-r) NefM-yM] (17.149) 

Jo 



where the function c(t), for example, can be chosen as the inverse of a given 
transfer function. Frequently, v will be a PI controller (Fig. 17.7), e.g. 



v{t) = K, 



1 

yieiiO - y(0 + - / (yref('^) “ y('^)) 

Xi Jo 



(17.150) 
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In this case, the system stability is conditioned by the roots of the characteristic 
polynomial 



^r+l gV ^ ^ ^ Pr—l s‘^ -\- [/dr + K^) S ~\ — 0 (17.151) 

Tl 

This control has been tested in different forms such as: combination with a 
feedforward controller (Daoutidis and Kravaris, 1989) to take into account 
disturbances, usage of a Smith predictor and a state observer to take into 
account time delays (Kravaris and Wright, 1989), modification to be applied 
to nonminimum-phase second-order systems (Kravaris and Daoutidis, 1990), 
application to a polymerization reactor (Soroush and Kravaris, 1992), with a 
robustness study (Kravaris and Palanki, 1988). 




Figure 17.7: Globally Linearizing Control with PI controller 

Generic model control (Lee, 1993) proposed by Lee and Sullivan (1988) has 
not been formulated by using the concepts of differential geometry. However, 
it can be considered in this framework. Indeed, it is based on a realization of 
the inverse of the model. For this reason, its applicability is limited (Henson 
and Seborg, 1990a), as it is reserved to systems of relative degree equal to 1. 

Certain authors (Bequette, 1991; Henson and Seborg, 1989, 1990b) compare 
globally linearizing control (GLC) and generic model control (GMC) to nonli- 
near control developed by differential geometry, as has been discussed by Isidori 
(1995). We can consider that GLC and GMC represent minor variants with 
respect to the general scope, which has been resumed in the previous sections. 



17.3 Multivariable Nonlinear Control 

Geometric nonlinear control can be extended to multi-input multi-output sys- 
tems; it will be presented only for systems with the same number of inputs and 
outputs. The details of the demonstrations can be found, in particular, in the 
textbook by (Isidori, 1995). Only the main points will be cited here. 

Multivariable nonlinear systems, affine with respect to the inputs, are mo- 
delled as 

X = f{x) +YT=i9i{x)ui 
Vi =hi{x) , i = 



(17.152) 
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with 





Ui 




" yi 




u = 




; y = 




(17.153) 




Um 




ym 





f{x),gi{x), hi{x) are smooth vector fields. In a more compact form, the system 
is modelled by 



X = f{x) + g{x) u 
y = h{x) 



(17.154) 



17.3.1 Relative Degree 

Many reasonings used to define the concepts for multivariable systems are an 
extension of the single-input single-output case. Some of them are specific. In 
the multivariable case, the notion of relative degree is extended by 

— The vector of relative degrees [ri, . . .r^]^ defined in a neighbourhood of 
by 



Lg^Lfhi{x)=0 , Vj = l,...,m , Vfc < r* - 1 
— The matrix A(x) defined by 



Vi = 1, . . . , m 
(17.155) 



A{x) 



- Lg,L^-^h^ix) 



Lg,„^Lj-^hi{x) 

Lg„Uf -^hM 



(17.156) 



which must be nonsingular at 

If we consider a given output of subscript i, we observe that the vector of the 
Lie derivatives verifies 



Lg^Vjhi{x) ... Lg^Vjhi{x) =0 V/c < — 1 



Lg,L^/-^h,{x) ... Lg^L^^-^h,{x) 



^0 



(17.157) 



where is the number of times that the output yi (t) must be differentiated to 
make at least one component of the vector u{t) appear. There exists at least 
one couple (iXj, yi) that have as the relative degree. 

Now consider a system having [ri, . . .r^]^ as the vector of relative degrees. 
The vectors 



Dhi{x°), DLfhi{x°), ..., DLy-^hi{x°) J 

Dh^{x°), DLfh^{x°), ..., DUf-^h^{x°) 



(17.158) 



are linearly independent. 
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17.3.2 Coordinate Change 

The proposed coordinate change is quite parallel to the single-input single- 
output case. Consider a system of relative degree vector [ri, . . Let the 

coordinate change 



(j){{x) 


= hi{x) 


4>^{x) 


= Lfhi{x) 


4>r, 


= V'f-^hiix) 



(17.159) 



Denote by r = ri + . . . + the total relative degree. 

If r < n, it is possible to hnd (n — r) additional functions . . . , (j)n{x) 

such that the mapping 

$(x) = [0}(x), . . . , (x), . . . , fpr+l(x), (j)n{x)] ^ 

(17.160) 

has its nonsingular Jacobian matrix at and thus constitutes a coordinate 
change in a neighbourhood U of 

Moreover, if the distribution 

G = span { 51 ,..., 5 m} (17.161) 

is involutive in [/, the additional functions: . . . , (j)rn{x) can be chosen 

such that 



Lg.(j)i{x) = 0 , r + l<i<n , l<j<m (17.162) 

The condition of nonsingularity of matrix A{x) can be extended to a system 
having more inputs than outputs; it becomes a rank condition: the matrix rank 
must be equal to the number of its rows, or, further, the system must have more 
inputs than outputs, which is classical. 



17.3.3 Normal Form 

The coordinate change gives for (j)\ (similarly for the other (j)^) 
0} = (p\{x) 

=4>l,{x) 

'^n-i = Ljhi{x) + LgjV'f~^hi{x)uj{t) 



(17.163) 
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The normal coordinates are introduced 



^ = with: e = 



" m 




(f>r+l{x) 


Tjfi — r 




. _ 



1 

1 




1 

• 

s 

1 


L ^ri J 







which gives the normal form 



, i = 1, . . . , m 



(17.164) 






C-i = C. (17.165) 

iU = v) Uj 

V = q{^, v) + p{^, i) u 

y = 



with 



(17.166) 



where aij are the coefficients of matrix A{x). 



17.3.4 Zero Dynamics 

The zero dynamics is defined in the same way as for single-input single-output 
systems. The inputs and the initial conditions are sought so that the outputs 
are identically zero in a neighbourhood U of The control vector must be 
such that 

u{t) = -A~^{0,r]{t))b{0,r]{t)) (17.167) 

and the zero dynamics or unforced dynamics is defined by the differential system 

v(t) = v) - v) v) v) , h(0) = V° (17.168) 

The control (17.167) could be expressed in the original state space as 

u*{x) = —A~^{x)b{x) (17.169) 

The change from the unforced dynamics to the forced dynamics would be re- 
alized in the same manner as for single-input single-output systems. 
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17.3.5 Exact Linearization by State Feedback and 
Diffeomorphism 

Consider the system 

X = f{x) g{x) u (17.170) 

without taking into account the outputs. 

We introduce the distributions 



Go 


= span { 51 ,..., 


9m} 


Gi 


= span { 51 ,..., 


9mi adj^i, . . . , adj^TT^} 


Gi 


= span 1 Sid^gi , 


. . . , ad^ 5 m ; 0 < A; < i j 



(17.171) 

The matrix g{x°) is assumed of rank m. The exact linearization is possible 
if and only if: 

— The distribution , (i = 0, . . . , n — 1) has a constant dimension in the 
neighbourhood of 

— The distribution Gn-i has a dimension equal to n. 

— The distribution , (i = 0, . . . , n — 2) is involutive. 

Several facts must be looked at: 

— The non-singularity of matrix A{x) (Eq. 17.156). 

— The total relative degree r must be equal to n. 

— As for single-input single-output systems, the outputs yi are given by the 
solutions Xi{x) , (j = 1 , . . . , m) of the equations 

Lg.L^^\i{x) = ^ , 0 < /c < — 2 , j = 1, . . . , m (17.172) 

The linearizing state feedback is 

u = —A~^{x)h{x) ^ A~^{x)v (17.173) 

and the linearizing normal coordinates are 

Ck{^) — Lj~^hi{x) , l</c<r^,i = l,...,m (17.174) 

17.3.6 Nonlinear Control Perfectly Decoupled by Static 
State Feedback 

The decoupling for the system (17.154) will be perfectly realized when any 
output Vi < i < m) influenced only by the corresponding input Vi. This 
problem has a solution only if the decoupling matrix A{x) is nonsingular at 
i.e. if the system possesses a vector of relative degrees. 

The static state feedback is equal to 



u = —A ^{x)b{x)-\-A ^{x)v 



(17.175) 
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To display the decoupling, it suffices to consider the system in its normal form 
(17.165) and to propose the following control law 

u= (17.176) 

which transforms the system into 

=€2 

C 1 — C 

= bik, v) + J2T=1 (17.177) 

= Vi 

V = q{^, v) + p{^, v) u 

y = 

as we could write 

'ik ' 

= b{^,v)+A{^,ri)u = v (17.178) 

dm 

_ Srm, - 

from Eq. (17.166), giving the coefficients a^j, the control law (17.176) and the 
normal form. 

Two cases can occur: 

— The total relative degree r is lower than n; there then exists an unobserv- 
able part in the system, influenced by the inputs and the states, but not 
influencing the outputs. 

— The total relative degree r is equal to n; the system can be decoupled into 
m chains, each composed of integrators. The system is thus transformed 
into a completely linear and controllable system. 

Of course, we must verify that the decoupling matrix A{x) is nonsingular at 

x° . 

Just as for single-input single-output systems, it is possible to realize a 
pole-placement by adding a state feedback as 

Vi = -4^ - ■ ■ ■ - (17.179) 

The different points treated for single-input single-output systems, asymp- 
totic stability of the zero dynamics, disturbance rejection, reference model 
tracking, are studied in a very similar manner. 

Example 17.1: Multivariable Nonlinear Control of an Evaporator 

(To et ah, 1995) describe nonlinear control of a simulated industrial evapo- 
rator by means of different techniques, including input-output linearization, 
Su-Hunt-Meyer transformation and generic model control. The model of the 
process has three states, two manipulated inputs and two controlled outputs. 
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17.3.7 Obtaining a Relative Degree by Dynamic 
Extension 

Some systems possess no relative degree vector. No static state feedback can 
change this result, as the relative degree property is invariant by this means. 
Suppose that the system 



X = f{x) + g{x) u 

y = h{x) 



(17.180) 



possesses no total relative degree, as the rank of matrix A(x) is smaller than m. 
To obtain the relative degree, a dynamic part is added between the old inputs 
u and the new inputs 'C, modelled by 



u = a{x, Q f3{x, Q V 

C = 7(a;,C) +<^(a;,C)w 

A simple type of dynamics used is the interposition of 
input Vi and the input Ui (Fig. 17.8) modelled in the 
by 

Ui = Cl 

C2 = Vi (17.182) 

Cl = C 2 



(17.181) 

integrators between an 
case of two integrators 



Vi = Cfe 




C2 


[ 


II 




J 





Figure 17.8: Dynamic extension 

The modihed system will be 

X = f{x) + g{x)a{x,C) + g{x) P{x,C)v 
C = 7(a;,C) +<^(a;,C)w (17.183) 

y = h{x) 

The algorithm of dynamic extension described by Isidori (1995) consists 
of progressively increasing the rank of matrix A{x, () corresponding to the 
modihed system until it is equal to m. In particular, this algorithm contains a 
procedure of identihcation of the inputs on which it is necessary to act. 

In summary, if the relative degree is obtained by dynamic extension and if 
the total relative degree of the extended system is equal to n, it is possible to 
assert that the original system can be transformed into a completely linear and 
controllable system by a dynamic state feedback and a coordinate change. 

Example 17.2: Nonlinear Control of a Continuous Polymerization 
Reactor with Dynamic Extension 

Soroush and Kravaris (1994) describe a continuous polymerization reactor where 
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conversion and temperature are controlled by manipulating two coordinated 
flow rates and two coordinated heat inputs. Modeled in this way, the system 
exhibits a singular characteristic matrix. They used a dynamic input-output 
linearizing state feedback by redefining the second manipulated input using 
just the rate of change (thus the derivative) of monomer flow rate instead of 
the original monomer flow rate. This amounts to adding an integrator. 



17.3.8 Nonlinear Adaptive Control 

Frequently, the process model is uncertain or time-varying. In this case, we 
can think of realizing on-line identification of physical parameters influencing 
the model (Kosanovich et ah, 1995). Thus, Wang et al. (1995) studied a batch 
styrene polymerization reactor. At the beginning of the reaction, the poly- 
merization has made little progress and the viscosity of the reactor contents is 
near that of the solvent. When the monomer conversion increases, the viscosity 
strongly increases because of the gel effect, and the heat transfer coefficient de- 
creases significantly because the stirring progressively passes from a turbulent 
regime to a laminar one, creating reactor runaway hazard. The model used 
by (Wang et ah, 1995) describes these phenomena, which have been taken into 
account by using an augmented state vector. Besides the traditional states, 
which are the concentrations and temperatures, the gel effect coefficient and 
the heat transfer coefficient are estimated during the reaction. The used tech- 
nique is inspired from procedures of recursive identification presented by Sastry 
and Bodson (1989). Presently, this research domain is very important Krstic 
et al. (1995). 

17.4 Applications of Nonlinear Geometric 
Control 

Example 17.3: Nonlinear Geometric Control of Chemical and Bio- 
chemical Reactors 

Two single-input single-output applications of nonlinear geometric control per- 
formed in simulation for realistic models are discussed in Chap. 19. The first 
one concerns a continuous stirred tank reactor where a chemical reaction takes 
place. The second one concerns a fed-batch biological reactor. These reac- 
tors present different relative degrees. For each of them, an extended Kalman 
filter is used to estimate the states. From these examples, it should easily 
become apparent to the reader how to apply nonlinear geometric control in a 
real example, at least for SISO systems. An example of pilot application to a 
copolymerization reactor preceded by a study of dynamic optimization in order 
to obtain the optimal profile temperature is given by Centric et al. (1999). 
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Chapter 18 



State Observers 



18.1 Introduction 

In many processes, e.g. in fine chemistry, biotechnology, measurements are 
difficult, needless to say scarce. Only some variables are easily measurable: 
temperature, pressure, flow rate, pH and concentration of some species. Nev- 
ertheless, to optimally control the processes in spite of the absence of adequate 
physical sensors, it is frequently necessary to know the values of variables that 
are not directly available to measurement. 

Thus, in the biological processes of fermentation, the biomass is difficult to 
measure on-line either for reasons of sampling difficulties or potential contam- 
ination problems, but it can be estimated from on-line measurements of pH, 
oxygen and carbon dioxide, and also from the studied fermentation process 
model. The stoichiometric relations, mass balances and possibly heat balances 
will be used. Given the low quality of the measurements, moreover it is gene- 
rally necessary to filter them before use. 

In a distillation column, the objective is to regulate top and bottom mole 
fractions; in the case of nonlinear control of this column, the vapour and liquid 
mole fractions inside the column need to be known; they can be reconstructed 
from the temperature measurements on sensitive trays and from the knowledge 
model of column. Besides the control itself, the prediction of the mole fractions 
profile in the column can allow us to realize not only monitoring for detection 
of misfunctioning, but also failure diagnostic (Li and Olson, 1991). 

An observer, or state estimator, or again intelligent sensor or soft sensor, 
is an algorithm based on the knowledge of models describing the behaviour of 
the process and using measurements acquired in the process to reconstruct the 
missing measurements. The possibilities of observers can be extended to the 
estimation of process characteristic parameters, such as in chemical engineering: 
heat of reaction, global heat-transfer or mass-transfer coefficient and kinetic 
constants. 

The observer is most often designed to estimate the states in a closed-loop 
control system needing the knowledge of the states; it can also simply be used 
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Figure 18 . 1 : Observer principle 



for process monitoring or diagnostics. 

18.1.1 Indirect Sensors 

Many physical sensors are based on a static model such as a mercury ther- 
mometer that uses the relation between quasi-linear dilatation of the mercury 
(observed phenomenon) and temperature, or a pH-meter that uses the poten- 
tial difference between two electrodes and the electrochemistry laws to provide 
an indication of the H+ concentration. These are indirect sensors. 



18.1.2 Observer Principle 

An observer (Luenberger, 1966, 1971; Misawa and Hedrick, 1989), can be de- 
scribed by the diagram in Fig. 18.1. The physical sensor realizes measurements 
in the process. The known control inputs and part or all of the measurements 
constitute the inputs of the estimator which gives an estimation of the states 
(eventually parameters) which are useful for monitoring or control. The esti- 
mator is developed from a linear or nonlinear dynamic model of the process. 
This state estimator can be deterministic such as the Luenberger observer or 
stochastic such as the Kalman filter, depending on whether the process model 
is deterministic or stochastic. 

The most general dynamic model is written in state space as 



X = f{x,u,0,t) 
y =h(x,t) 



(18.1) 



where x represents the state vector and 6 the parameter vector on which the 
model depends. 

The general equation of an observer is resumed as 

new estimated state = old estimated state + 
gain X [ measurement — measurement estimation ] 
which means that a state estimation is realized from the previous state esti- 
mation by adding a correction that is proportional to the deviation between 
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the measurement and its estimation. The objective of an observer consists of 
calculating in an optimal manner the gain that is the proportionality factor, 
while ensuring the estimation stability. 

Another expression is possible: 

derivative of the estimated state = dynamic state model + 
gain X [ measurement — measurement estimation ] 

This second form can be considered as a continuous form (making use of the 
process continuous model) whereas the hrst form is discrete (uses time t — At 
and time t). 

The observer convergence and stability are key-points. Two types of con- 
vergence are theoretically dehned: exponential or asymptotic convergence. The 
quality of estimation will depend amongst other things on the model and mea- 
surement quality. 



18.2 Parameter Estimation 

In many cases of processes, the parameters are ill-known (heat of reaction, 
transfer coefficient, etc.), progressively time- varying (catalyst activity, fouling, 
etc.) or fast (leak in a pressure vessel, choking in a column, etc.). The engineer 
responsible for the process might wish to evaluate them or display a change 
of behaviour. The particular case of biological reactors with the estimation of 
kinetic parameters has been studied (Bastin and Dochain, 1990; Cherny and 
Flans, 1994). In this perspective, it is possible to consider an augmented state 
vector constituted by the model states x and the parameters 6 concerned by the 
estimation procedure. The linear or extended Kalman hlter (Ljung, 1979) will 
be applied to this augmented state vector in an analogous way to the estimation 
of the states alone. There exists a recursive form of the linear Kalman hlter 
used for parameter estimation which needs no matrix inversion; it is based on 
the matrix inversion lemma (Soderstrom and Stoica, 1989). 



18.3 Statistical Estimation 

In existing processes, due to process computers connected to numerous sensors 
with small sampling periods, large amounts of data are collected. For process 
monitoring, fault detection and statistical process control, statistical methods 
such as PCA (principal component analysis) or PLS (partial least squares or 
projection to latent structures) can be implemented to reduce the informa- 
tion so that simple models retain a maximum of information (Burnham et ah, 
1996; Kaspar and Ray, 1992; Kresta et ah, 1991; Rencher, 1995). This type of 
method has been used, in particular, for distillation columns (Kano et ah, 2000; 
Kresta et ah, 1991; Mejdell and Skogestad, 1991a, b) to estimate the product 
compositions from the temperature measurements along the column. 

The objective of these methods is thus to provide a predictive model that 
is capable of explaining the main inhuences of the process measured variables 
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on the product quality variables. The dimension of the model is very much 
reduced compared to the dimension of the concerned data group. 



18.3.1 About the Data 

PCA (principal component analysis) and PLS (partial least squares) are de- 
signed to treat large amounts of correlated data (Otto, 1999; Tenenhaus, 1998). 
Following Kresta et al. (1991), the data are classified in two groups: 

• The process measurements or inputs grouped in a matrix X (size n x k) 
containing k variables, each with n observations. As k is larger than 1, the 
analysis is called multivariate. 

• The product quality variables or outputs (of number m) grouped in a matrix 
Y (size n X m). 

The data X are, in general, transformed into reduced deviation variables. The 
data Y are often only centred. Consider, for example, the matrix X. They are 
first centred around the mean of the calibration set. Then, the centred data 
are scaled to unit variance. 

Finally, the matrix X is transformed into a matrix Z such that 



Z = 



with: Xj = — — — — and: Sj 

n 



\ 



Y^iXkj - Xjf 



k=l 



n — 1 



The covariance matrix for the data matrix X is equal to 
Cov(X) = [Cov(i, j)] 



(18.2) 



with < 



Cov(i, i) = ^ - Xjf i = l,...,p 



n — 1 



k=l 



Cov{i,j) = —^'f2i^ki-Xi){xkj -Xj) i,j = l,...,p ; i^j 



k=l 



(18.3) 

If the data matrix X is centred and scaled to unit variance, the covariance 
matrix of X is equal to the correlation matrix of X defined as 



Corr(X) = [Corr(i, j)] 

Corr(i, i) = 1 i = l,...,p 



Corr(.,j) = 






(18.4) 



where Si (similarly for sj) is the standard deviation defined in (18.2). 



18.3.2 Principal Component Analysis 

In principal component analysis (PCA), data are modelled using orthogonal 
components in order of decreasing importance. The dependent data are thus 
transformed into significant independent ones. 
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According to singular value decomposition (SVD), the matrix X can be 
decomposed into 

X = UT,V'^ (18.5) 

where U is the matrix formed by the normalized eigenvectors of XX^ ordered 
as columns, V is the matrix formed by the normalized eigenvectors of the 
correlation matrix X^X ordered as columns and X) is the diagonal matrix 
having its elements equal to the square roots (singular values of X) of the 
ordered eigenvalues of X^X. 

In PCA, the matrix X is decomposed into 

N 

X = tipj + En (18.6) 

i=l 



where N is the chosen or optimal number of latent vectors, U are the scores 
or principal components associated with the i-th latent vector of X, p- is the 
loading vector for the i-th latent vector of X and is the residual matrix 
of X due to the reduction of dimensionality. X is projected onto a reduced 
A^-dimension subspace by the projection matrix P and takes the coordinates T 
in this subspace. The t^’s and p^’s are forced to be orthogonal, and orthogonal 
to the rows or columns of E so that t^’s are eigenvectors of XX^ and p^’s are 
eigenvectors of as in SVD. It results that SVD and PCA are related by 

P = V and T = UT, if E = 0. Thus, the data contained in X are modelled 
using orthogonal components. As for SVD decomposition where the eigenvalues 
are calculated in decreasing magnitude order, in PCA the eigenvectors are 
calculated in order of decreasing importance. However, SVD is inefficient at 
calculating the principal components for correlated data sets with N k. 
SVD would be equivalent to PCA if the contribution of the singular values in 
X) lower than a given threshold were neglected. The optimal dimension N can 
be obtained by verifying that the addition of new principal components adds 
little information, or it can be done by cross-validation on another set of data 
for which the prediction error sum of squares is calculated. When the latter 
value is sufficiently small, the number N is optimal. Often, a percentage of the 
explained variance is given as 



100 






(18.7) 



After retaining a few principal components (2, 3, . . .), the data are represented 
graphically in the planes formed by two given principal components. 

The matrix Y is similarly decomposed into 



N 

Y = Y,UiqJ + Fn (18.8) 

i=l 

The algorithm describing the iterative principal component analysis called 
NIPALS (nonlinear iterative partial least squares) working on the data X is 
the following: 
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Initially, centre and scale to unit variance the data matrix X. Note Xq = 

X. 

N being the number of principal components, denote by k an iteration step 
with k = 1, . . . , 

1. Set t/g equal to a column of Xk-i, in general the first one. 

2. Calculate the loading vector pj^ = X"^_itk/t^tk 

3. Normalize the loading vector p^ = Pk/WPkW' 

4. Calculate the scores tk = Xk-\Pk/p\Pk' 

5. Compare new and old t vectors: if convergence, go to step 6, else go to step 

2 . 

6. Calculate the residual matrix Ek = Xk = Xk-i — tkP^k • If number of 
principal components is sufficient, end, else go to step 1. 

Finally, the principal component prediction model is 

X=TP^ (18.9) 



where T and P represent the storage matrices for the successive vectors t and 

P- 

After convergence of the sequence of steps 2 to 5, the vectors p and t tend 
respectively towards the eigenvectors of the respective matrices -^^X^ X and 
-^^XX^ . Furthermore, the corresponding eigenvalue is equal to 
Thus, an often used index is the fraction of explained variance, defined as 



/ 



ELi A. 



(18.10) 



where the A^’s are the eigenvalues and N is the number of principal components. 
The numerator considers only the larger eigenvalues retained in the PCA and 
the denominator considers all the eigenvalues of the matrix -^^X^ X . 

Note that the NIPALS algorithm described above can be used with little 
difference in the case of missing data (Tenenhaus, 1998). 



18.3.3 Partial Least Squares 

Partial least squares (PLS) is a method of analysis of data developed in partic- 
ular by Wold et al (Trygg and Wold, 2002; Wold et ah, 1984, 1987, 2001), and 
many reviews are available (Geladi, 1988; Geladi and Kowalski, 1986; Hoskulds- 
son, 1988). The PLS method presents many common points with the PGA 
method but it works simultaneously on the matrices X and Y . The loading 
vectors which were orthogonal in PGA are not orthogonal anymore in PLS. 
The PLS algorithm (Kresta et ah, 1991; Tenenhaus, 1998) is the following: 
Initially, centre and scale to unit variance the data of X, centre (and in 
general scale) the data of Y . Note X^ = X and Yq = Y. 

N being the number of principal components, denote by k an iteration step 
with k = 1, . . . , X. 

1. Set Uk equal to a column of X, in general the first. 
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2. Calculate Wk = X"^_iUk/u^Uk (the columns of X are regressed on u). 

3. Normalize Wk so that \wk\ = 1. 

4. Calculate the scores tk = Xk-iWk/w'^Wk- 

5. Calculate = t^Y /t^tk (the columns of Y are regressed on t). 

6. Calculate new vector Uk = Y q^/ (^q^. 

7. Compare new and old u vectors: if convergence, go to step 8, else go to 
step 2. 

8. Calculate the X loadings = X^^tk/t^tk- 

9. Calculate the residuals Xk = Xk-i — tkPk Xk = Yk-i — tkqk • 
Increment k hj 1. If k < N, go to step 1, else end. 

This algorithm is a modihed version of the original PLS algorithm, where in 
steps 1, 5, 6, 8, the matrices Xk-i and Yk-i are used instead of respectively 
X and Y as here. 

The predicted response matrix by the PLS model is = T ^ where T 
and Q represent the storage matrices for the successive vectors tk and e.g. 

T=[ti t2 ... In] (18.11) 

For the whole matrix of data X, the corresponding predicted matrix can 
be given as 



N 

Y = ^tkql=TQ^ with: T = X W{P^ W)~^ (18.12) 

k=l 

where P and W are respectively the storage matrices for the successive vectors 
Pk and Wk. 

The regression model with respect to centred and scaled data gives the row 
vector of predicted variables 

y = xW{P^W)-^Q^ (18.13) 

where x is the analytical row vector of components xi, . . . , Xn^. 

Algorithms exist for nonlinear PLS (Hassel et ah, 2002) and dynamic PLS 
(Lakshminarayanan et ah, 1997). 



18.4 Observers 

18.4.1 Luenberger Observer 

Consider a system described by the linear deterministic model 

x{t) = Ax{t) -\- B u{t) C8 14^ 

v(t) =C*(t) 

A Luenberger observer (Luenberger, 1966, 1971, 1979), is described by the 
following dynamic system 

z = A x{t) + B u{t) + G (y{t) — C x{t)) 

= {A-GC)x{t)AB u(t) + G y(t) 



(18.15) 
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The dynamic response is determined by the matrix A — GC whose eigenvalues 
can be fixed (provided the system is observable) by conveniently choosing the 
matrix G. 

18.4.2 Linear Kalman Filter 

The linear Kalman filter (Kalman, 1960; Kalman and Bucy, 1961) has been 
described in Chap. 11. This optimal filter is a method of stochastic estimation 
that calculates the gain matrix by minimization of the sum of the estimation 
error variances (Bozzo, 1983). 

In general, two types of linear Kalman filters are distinguished: the con- 
tinuous-discrete Kalman filter where the model is continuous and the mea- 
surements are discrete, the discrete-discrete Kalman filter where the model is 
discrete and the measurements are discrete. 

An implementation of the discrete linear Kalman filter (Brown and Hwang, 
1997) easily usable for state estimation is proposed in Eqs. (11.114) to (11.116). 

In Gaussian linear quadratic control (Sect. 14.6.2) that uses the linear 
Kalman filter to estimate the states, the separation principle based on the 
closed-loop model eigenvalues was shown: it is possible to separately deter- 
mine the observer and the state feedback optimal control law. Similarly, the 
separation principle is applicable to Luenberger observer. 

18.4.3 Extended Kalman Filter (EKF) in 
Continuous-Discrete Form 

The extended Kalman filter (EKF) is an extension of the linear Kalman filter 
to the case where the system is described in state space in a nonlinear form. 
The derivative of the state vector is equal to 

X = f{x,u, O^t) w (18.16) 

where it; is a Gaussian noise of zero mean and covariance matrix Q. The 
measured outputs z are described by the model 

Zk = h{xk) + Vk (18.17) 

where Vk is a Gaussian noise of zero mean and covariance matrix Rk. 

In this way, EKF is defined in the continuous-discrete form, i.e. the process 
is continuous and the outputs are discrete as they are measured at sampling 
times denoted by /c, which are in general regularly, but not necessarily, spaced. 

In this first approach, the model parameters 6 are assumed to be perfectly 
known; the model can thus be written in simplified form 

x = f{x^u^t)Aw (18.18) 

The problem is to estimate at time k the unmeasured states by using the 
available process measurements at time k — 1. The recurrent algorithm of the 
filter includes two stages: 




Process Control 



665 



1. Propagation of the state estimation and the error covariance matrix. 

This means that the differential equations describing the variation of the state 
vector X and the error covariance matrix P are integrated in the time interval 
[k — l^k] to obtain a prediction, whereas the measurement has not yet been 
realized. The predictions of x{tk) and P{tk) are respectively denoted by Xk{—) 
and Pk{—). The differential equations to be integrated are 



x{t) = f{x^u^t) ; :c(0) = a:^o (18.19) 



P{t)=F{x,t)P{t) + P{t)F\x,t)+Q{t) ; P(0)=Po (18.20) 

where F is the Jacobian matrix of / equal to 



F{x,t) 



dl 

dx 



x=x 



(18.21) 



2. Update of the state estimation and the error covariance matrix. 

The measurements realized at instant tk are used to correct the estimations 
Xk{—) and Pk{—) by minimizing the estimation error. The estimations thus 
corrected, obtained at instant are denoted by Xk{-\~) and Pk{-\~) and are 
equal to 

Xk{+) = Xk{~) + Kk [Zk - h{xk{-))] (18.22) 

Pfe(+) = [I-Kk Hk{xk{-)) ] Pfe(-) (18.23) 

where Hk is the Jacobian matrix of h equal to 






dh 

dx 



X=Xk{~) 



(18.24) 



and Kk is the Kalman gain matrix equal to 



1 -1 



Kk = Pk{-) Hi {xkH) HkixkH) PkH Hi {xkH) + Rk (18.25) 



Very frequently, the noise covariance matrices Q and Rk that represent the 
measurement of the model and measurement uncertainty are assumed to be 
diagonal. Their adjustment can be done by simulation. 

The Kalman gain K can affect in different ways the various estimated states. 
Indeed, the elements of the Jacobian matrix F represent, through the model, 
the sensitivity of the different states one to each other. 

Because of its similarity with the linear Kalman filter, the extended Kalman 
filter (Fig. 18.2) is often used although it presents some drawbacks: 

— P being only an approximation of the true covariance matrix, the extended 
Kalman filter performance cannot be guaranteed and its stability cannot be 
proved. 

— The extended Kalman filter equations assume that the process model is 
exact. No robustness is guaranteed against modelling errors. 
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Figure 18 . 2 : Implementation of the extended Kalman filter in a control 
scheme 



For these reasons, variants have been designed (Misawa and Hedrick, 1989) 
such as the constant gain extended Kalman filter (Safonov and Athans, 1978) 
which was essentially proposed to avoid the long calculations related to the 
update of the state and covariance matrix estimations. Becerra et al. (2001) 
have applied the extended Kalman filter to systems described by differential- 
algebraic models. 

Industrial experiences related to the use of extended Kalman filter are cited, 
in particular, in the article of (Wilson et ah, 1998) where general recommenda- 
tions are given and cautions are presented for industrial applications concerning 
difficulties that scarcely appear in simulation. 



18.4.4 Globally Linearizing Observer 

The development of the globally linearizing observer (Isidori, 1995; Krener and 
Isidori, 1983) can be considered dually by comparison with the exact state- 
space linearization which allowed us to build a state feedback with prescribed 
eigenvalues. In the case of the observer synthesis, we are looking for an ob- 
servation error dynamics which becomes, after coordinate change, linear and 
whose eigenvalues can be prescribed. 

Consider the nonlinear system 



X = f{x) 

y = h{x) 



(18.26) 



We seek a diffeomorphism z = ^(x) transforming the system (18.26) into a 
linear system 



z 

y 



/(®) 



dx 



x=^-^(z) 



Az -\- k{y) = Az A k{C z) 
Cz 



(18.27) 



where the pair (A, C) is observable and A; is a function vector. 
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In these conditions, the observer is given by 



^ =A^ + k{C^) + G{C^-y) 
= {A + GC)^ + k{y)-Gy 



(18.28) 



The observation error, i.e. the difference between the theoretical and estimated 
state, is equal to 

e = ^ - z = ^ - ^{x) (18.29) 

and its dynamics is described by the linear system 

e = {A^GC)e (18.30) 



whose eigenvalues can be prescribed through the gain vector G. 

The observer linearization is possible if and only if 

dim(span |d/i(a:^°), dLj^h{x °)^ . . . , dL^~^h{x°)^) = n 

We then seek the unique vector r in the neighbourhood U of x^ 

Lrh{x) = LrLfh{x) = . . . = LrLy‘^h{x) = 0 
LrLy^h{x) = 1 

then the mapping F such that its Jacobian matrix is 

B F 

— = [t{x) -adfT{x) ... (-l)”“^ady 

providing a system of n partial differential equations. Then we set 

$ = F~^ 



(18.31) 
such that 



(18.32) 



(18.33) 



(18.34) 



and we use 



^(-^n) 



dx 



/(®) 



x=^-^(z) 



0 



Zn-l 



(18.35) 



The implementation of such an observer is, of course, not very simple and 
the following high-gain observer is easier to implement. 



18.4.5 High-Gain Observer 

The high-gain observer (Gauthier et ah, 1992) is a kind of extended Luenberger 
observer. Consider the nonlinear system 



X =f{x)Ag{x)u 

y = h{x) 



(18.36) 
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The mapping ^ defined by 






h{x) 

Lfh{x) 

L^-^h{x) 



(18.37) 



constitutes a diffeomorphism that transforms the system (18.36) into 



X 



y 



±1 




' X2 




±2 




X3 






= 




+ 


^n—1 




Xn 




_ Xn 




_ _ 





= xi = C X 



gi{xi) 

92{xi,X2) 



Qnixx^ ■ ■ ■ 5 ^n) 



The system is assumed to be uniformly observable. 
The following dynamic system 



u = F{x) + G{x) u 



(18.38) 



X = F{x) + G{x) u — {C X — y) (18.39) 

constitutes an observer for the system (18.36). The positive definite matrix 
Soo is the solution of the equation 



-0Soo-A^Soo-SooA + C'^C' = O (18.40) 



where is a large scalar that allows us to adjust the convergence rate of the 
estimator, with C = [1 0 ... 0] and A is the matrix 



r 0 1 0 ...0 1 



A = 



0 



•. 0 

0 1 

... 0 



(18.41) 



The observer thus defined converges exponentially. It has been applied in sim- 
ulation in biological reactors (Gauthier et ah, 1992). 

A slightly different version of the observer, where the matrix Soo is replaced 
by matrix S calculated as the solution of the differential equation 

S=-eS-A^S-SA + C'^C (18.42) 

has also been used in simulation in chemical reactors (Gibon-Fargeot et al., 
1994). 
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18.4.6 Moving state estimation 

The moving state observer is an efficient means of estimating the states, with 
in particular the possibility to constrain the states, ouputs and noises. It can 
be grossly described as a least-squares optimization leading to an estimation 
of the states and working with a limited amount of information. It avoids 
the recursive manner characteristic of the extended Kalman filter. It possesses 
common characters with model predictive control (chapter 16). It has been 
studied by several researchers (Alamir, 1999; Kwon et ah, 1999; Michalska and 
Mayne, 1995; Rao et ah, 2001; Robertson et ah, 1996; Slotine et ah, 1987; Wang 
et ah, 1997; Zimmer, 1994) under different approaches, however presenting some 
similarities. 

The full and moving state estimations follow more or less the same steps. 
The main difference of the moving state estimation with respect to the full 
state estimation resides in the handling of the variables. In the full state es- 
timation, at current time /c, all variables (manipulated inputs, measured out- 
puts, estimated states) from initial time i = {) to i = k are necessary and used 
in the calculation. In the moving state estimation with horizon i7, only the 
concerned variables (manipulated inputs, measured outputs, estimated states) 
from i = k — H to i = k are necessary and used in the calculation, they are 
collected in moving vectors. 

First consider the problem of full state estimation. Assume that the process 
is represented by the following continuous-time model similar to the model used 
by the extended Kalman filter 

^(0 = u{t)) + Gw{t) (18.43) 

where u; is a Gaussian noise of zero mean. The measured outputs y are de- 
scribed by the discrete-time model 

y^ = h(xk) +Vk (18.44) 

where 1 ;/^ is a Gaussian noise of zero mean. 

The continuous nonlinear model (18.43) is approximated by the linear dis- 
crete model 

Xk+i = Axk^ Buk^Gwk (18.45) 

where A and B are the Jacobian matrices with respect to Xk and Uk respecti- 
vely. The measurement model is linearized as 

Vk+i = C Xk+i + Vk+i (18.46) 

where C is the Jacobian matrix of h with respect to Xk. 

In the full state estimation problem, the following criterion 

k-l 

Jk = (a:o - xof Ho ^ (a;o - «o) + R~^ Vj+i + wj Q~^ Wi) (18.47) 

^=0 

is minimized with respect to the initial state Xo and to the sequence of noises 
{wo, . . . , Wk-i}- Then the states Xi are obtained by use of equation (18.45). 
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The weighting matrices R~^ and 11“^ respectively symbolize the confi- 

dence in the dynamic model, the measurements and the initial estimation. 

For example, at time k = 1 when the first optimization problem is solved 
in full state estimation, the criterion Ji 

Ji = (a;o - xo)'^ Ho ^ (a;o - xq) + R~^ vi + Wq Q~^ wq) (18.48) 

is minimized with respect to Xq and Wq based on the available measurement 
Hi related to the relation 



vi = - C xi = - C {Axo ^ B uo AGwq) (18.49) 

The optimized variables being denoted as Xq and Wq, this yields the predicted 
state 

xi = AxqA Buo^ Gwq (18.50) 

A drawback of full state estimation is that the size of the optimization 
problem grows as time increases, which would very likely induce a failure in 
the optimization or pose problems in real-time control. The logical solution 
to this increasing size is to set the problem according to the moving-horizon 
approach. 

Now consider the problem of moving state estimation. The criterion (18.47) 
is split into two parts (Rao et ah, 2001; Robertson et ah, 1996) 

k-l 

Jk = Jk-H+ Y. {■vJ+iR~^Vi+i+wjQ~^Wi) = + (18.51) 

i=k-H 

Clearly, the second part of the criterion (18.51) depends on the state 

Xk-H and on the sequence of noises {wk-H^ • • • ^'^k-i}- The parallel with 
dynamic optimization (section 14.5) appears in this splitting as Jk-H itself 
must be optimized. If /c < i7, the problem is a full state estimation problem. 
Assume that k > H and set the optimized criterion 



Jk-H = min Jk-H 

Xo,Wo,...,Wk-H-l 



SO that, in the full optimized criterion becomes 



(18.52) 



T* — 
aju — 



mm 

Xo,Wo,...,Wk-i 



Jk 



mm 

Z,Wk-H ,---,Wk~i 



k-l 






li=k-H 



+ Jk-ni^) 



(18.53) 

where z is the arrival state Xk-H based on the optimized variables Xq and 
i'^k-H^ • • '^k-H-l}' 

In actual practice, it is not possible to really minimize Jk-ni^) when k 
becomes large as this would be again a full estimation problem. The solution 
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is to retain the previous values of the optimized criterion obtained by mov- 
ing horizon estimation denoted by along time k and to approximate 

Jk-H{z) as 



Jk-H{z) ^ (z - ^k-H “ ^T-h) + (18.54) 

where is the state estimated by moving horizon estimation at time {k — 

H). Under these hypotheses, the criterion (18.51) becomes 



k-l 

i=k-H 

(18.55) 

The discrete Riccati equation (11.92) used for the covariance matrix of the 
Kalman hlter is called to update Ilk 

Uk = A Uk-i ^GQG^ -A Uk-i [C Uk-i ^ R]~^C A^ 

with: IIo given 

(18.56) 

Algorithm of the moving state estimation: 

1 / Initialization of the states, of the matrices of the criterion. 

2/ Get the measurements y{k). Store the previous manipulated inputs and 
measured outputs. 

3/ Compare the current time index k to the value of the moving horizon H. 
If k < perform full state estimation. If k > i7, perform moving state 
estimation. 

4/ State estimation: 

4a/ Call the initial parameters for nonlinear optimization. 

4b/ Perform the nonlinear optimization. 

4c/ Calculate the new estimed states x. 

4d/ Update the covariance matrix tt. 

4e/ Store the estimated states (and the optimal value of the criterion). 

4f/ Go back to stage 2. 

Example 18.1: Moving horizon state estimation for a biological pro- 
cess 

The studied biological process is a fed-batch bioreactor where three states 
are considered: biomass (X), substrate (S) and product (P). Only the sub- 
strate is measured. The biomass cannot be measured on-line without noticeable 
delay and the product is in general only measurable with a rather important 
delay. 

The model of this biological process is formulated under nonlinear continu- 
ous state-space form and comes from mass balances and equations of biological 




Product concentration (g/1) Biomass concentration (g/1) Substrate concentration (g/1) 




Figure 18.3: Moving state estimation of substrate 




Figure 18.4: Moving state estimation of biomass 




Figure 18.5: Moving state estimation of product 

18.5 Conclusion 



State and parameter observers are capable of providing a significai 
operators for process monitoring and diagnostic. In the case of line; 
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Part VI 

Applications to Processes 




Chapter 19 



Nonlinear Control of 
Reactors with State 
Estimation 

19.1 Introduction 

Many linear and nonlinear control methods exist. Very often, the choice of a 
particular method depends on the theoretical knowledge of the person respon- 
sible for the development of the control system. This choice also depends on 
the characteristics of the studied system. Linear control such as pole-placement 
(Sect. 13.1), internal model control (Sect. 13.2) and generalized predictive con- 
trol (Sect. 15.6), have been previously studied using an identihed linear model 
(Sect. 12.7.2) of the chemical reactor described in Sect. 19.2. 

Exothermic chemical reactors, especially polymerization reactors, present 
several stationary points and must frequently be controlled around an unstable 
point. Their characteristics are highly nonlinear, particularly for batch reactors. 

Biological reactors are most often operated in batch or semi-batch mode, 
especially to avoid contamination problems. Their behaviour can only be de- 
scribed by complex kinetic models that are able to represent them in a large 
operating domain. 

For the reasons previously cited, nonlinear geometric control is well adapted 
for such systems. It allows us to fully exploit the knowledge that the engineer 
has about the process. However, it necessitates us knowing the system states, 
and a state observer needs to be coupled. 



19.2 Chemical Reactor 

Examples of nonlinear geometric control can be found for an exothermic re- 
actor (Limqueco and Kantor, 1990), for control of batch reactors (Kravaris 
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et ah, 1989; Soroush and Kravaris, 1992), with feedforward- feedback (Daoutidis 
et ah, 1990), for a multivariable continuous polymerization reactor (Soroush 
and Kravaris, 1993, 1994). 

The example retained here is classical with regard to the mass and heat 
balances describing the dynamic behaviour of a chemical reactor. It concerns 
a continuous reactor in which a single reaction occurs: A — > B . The nonli- 
near state-space model is composed of three differential equations describing 
the evolution of the three states {Ca concentration in A, T reactor tempera- 
ture, Tj jacket temperature), among which only one is measured (the reactor 
temperature). 

19.2.1 Model of the Chemical Reactor 




Figure 19.1: Scheme of the chemical reactor with jacket and heat exchanger 
system acting on the temperature entering the jacket 

The continuous chemical reactor (inlet with subscript “0”, outlet having the 
properties of the reactor contents) is assumed to be perfectly stirred. The 
n-order reaction A — > B releases a heat of reaction AH. The reactor (Fig. 
19.1) is surrounded by a jacket (subscript “/’) of constant volume Vj crossed by 
the heat-conducting ffuid circulating at a constant ffow rate Fj with a variable 
inlet temperature Tj^i^. Corresponding to a lab pilot reactor, the inlet jacket 
stream results from the mixing of two streams, one running through a cold 
heat exchanger (temperature Tc), the other one running through a hot heat 
exchanger (temperature Th). The proportion of ffuid through these exchangers 
depends on the position n of a three-way valve. Thus, the inlet jacket tempe- 
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rature is grossly a weighted function of both heat exchanger temperatures. A 
proportional feedback controller regulates the liquid level in the reactor around 
a volume set point Vc (Luyben, 1990). The features of this reactor for the 
heat exchanger design are very close to the pilot reactor used by Centric et al. 
(1999). 

The equations ruling this reactor are the following: 

• Inlet temperature in the jacket 



TjAn = uTh + {1 - u)Tc 
• Mass balance of the reactor 


(19.1) 


dV/dt = Fo-F 


(19.2) 



• Balance in component A 

d(CA)/dt = Fo {Cao - Ca)/V -kCl (19.3) 

• Energy balance for the reactor 

dT/dt = Fo/V (To -T)-AHk C\/{p C^)-U A(T- Tj)l{V p Cp) (19.4) 

• Energy balance for the jacket 

d(Tj)/dt = Fj/Vj {TjAn -Tj)FUA{T- Tj)!{Vj pj Cpj) (19.5) 

• Proportional level control in the reactor 

F -F^ = SF = -K^ (Vc - V) (19.6) 

• Kinetic constant and activation energy 

k = ko exp(— T/TT) (19.7) 

To simplify, we assume that the reaction order n is equal to 1. 

The reactor characteristics are given in Table 19.1. 



19.2.2 Control Problem Setting 

The control aim is to make the reactor contents follow a temperature prohle. 
Erom the previous differential equations, the state vector is 

X = [xi = Ca ] X2 = T ] Xs = Tj ] = V (19.8) 



The state-space model is 



Xi 



X2 

Xs 

X4 



Fq {Cao — xi) 

X4 

Tq (Tq — X 2 ) 

./• I 

Fj {Tj,in — Xs) 

Vj 

Fo-F 



AiJfcoexp(--^^)xi 

P 

U A{x2- X3) 

Vj pj Cpj 



U A {x2~ X3) 

X4pCp 



(19.9) 
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Table 19.1: Initial variables and main parameters of the CSTR 



Flow rate of the feed 


Fo = 3 X 10 ^ m^.s ^ 


Concentration of reactant A in the feed 


Cao = 3900 mol.m“^ 


Temperature of the feed 


To = 295 K 


Volume of reactor 


V = 1.5 m^ 


Kinetic constant 


ko = 2x lO^.s-i 


Activation energy 


E = 7 X 10^ J.moD^ 


Heat of reaction 


AH =-7x lO"* J.mol-i 


Density of reactor contents 


p = 1000 kg.m“^ 


Heat capacity of reactor contents 


Cp = 3000 J.kg-I.K-I 


Temperature of the cold heat exchanger 


Tc = 280 K 


Temperature of hot heat exchanger 


Th = 360 K 


Flow rate of the heat-conducting fluid 


Fj = 5 X 10“^ m^.s“^ 


Volume of jacket 


Vj =0.1 m^ 


Heat transfer coefficient between the jacket 




and the reactor contents 


U = 900 W.m-2.K-i 


Heat-exchange area 


A = 20 m^ 


Density of the heat-conducting fluid 


Pj = 1000 kg.m“^ 


Heat capacity of the heat-conducting fluid 


Cpj = 4200 J.kg-bK-i 


Proportional gain of the level controller 


= 0.05 s-i 



y = X 2 (19.10) 

To maintain the reactor around its equilibrium state, a nonlinear control 
based on differential geometry is used. It necessitates us knowing the states, 
but two of these are not measured. Thus, an observer or state estimator must 
be implemented. In the present case, we will use the extended Kalman filter 
(Watanabe, 1992) as the reactor dynamic model is nonlinear. If this model 
were linear, the linear Kalman filter would have been applied. On the other 
hand, the knowledge of the concentration in the reactor based only on the mea- 
surement of the temperature can be important information for the operators 
running the process. Notice that a minimum model of the process is necessary, 
and that the better the model, the better will be the estimation for a given mea- 
surement quality. In the same arena of ideas, the estimation of more complex 
variables concerning polymerization reactions such as the polydispersity and 
the molecular weight distribution moments is possible (Centric et ah, 1999). 



19.2.3 Control Law 

By replacing the inlet jacket temperature Tj^in by its expression with respect 
to the position u of the valve, the dynamic model (19.9) of the chemical reactor 
is expressed as a nonlinear system, single-input single-output, and affine with 
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respect to the control input, as 



/(x) +g{x)u 



with the following vector helds 
^0 {Cao — xi) 



fi^) = 



y = h{x) 



E 



X 4 



- fcoexp (-— — )xi 



Rxo 



E 



F, (r, - .T2) — 



)xi 



X4 pCp 

Fj (Tf - X 3 ) U A {X 2 - X 3 ) 



U A{x2~ X 3 ) 
X 4 pCp 



Fn-F 



Vj pj c, 



PJ 



(19.11) 



(19.12) 



and the scalar output 



g{x) 



0 

0 



Fj (Te - Tf) 

V, 



h{x) = X 2 



(19.13) 



(19.14) 



First, the relative degree of this system is to be determined. For that, the 
following Lie derivatives are calculated: 

— The Lie derivative of h{x) in the direction of the vector held / 



dh 

Ffh{x) =—f{x)=f2{x) 

Fo X AiJ , , , UA , 

= — (To - X 2 ) 7 ;- ko exp{-E/Rx 2 ) xi — [X 2 

X4 pCp x^pCp 

— The Lie derivative of h{x) in the direction of the vector held g 

LgL^jh{x) = Lgh{x) = 0 

which implies that the relative degree is larger than 1. 

— The Lie derivative of Lfh{x) in the direction of the vectors held g 

^ ^ 7 / N dLfh(x) , , U A ^ ^ , 

9{x) = ^jl^j (^c — Tf) 



-X 3 ) 

(19.15) 

(19.16) 



(19.17) 



This derivative is always different from zero, as the characteristic tempera- 
tures of the cold and hot heat exchangers are different. The relative degree 
of the output h{x) = X 2 of the nonlinear system (19.9) is thus equal to 2, 
i.e. it is necessary to twice differentiate the temperature to make the input 
appear linearly. 
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Figure 19.2: Nonlinear geometric control of the chemical reactor with pole- 
placement 



In order to eliminate the stationary error, a PI controller is introduced so that 
the external input is equal to 



v{t) = Kc 



(yr(t) -y(t)) + 



Tl 



f 



(yr(T) -y{T))dT 



(19.18) 



where yr is the set point and y the controlled output. The complete system is 
represented by the block diagram in Fig. 19.2. 

The general form of the state feedback control law that linearizes the input- 
output behaviour of the closed-loop system (Isidori, 1995) results 



V — coh{x) — c\Lfh{x) — L‘jh{x) 
LgLfh{x) 



(19.19) 



where cq,ci are constant scalar parameters. The Lie derivative L^h{x) is only 
presented in an abbreviated form, which can be easily expressed 



Ljh{x) = J2^Jiix) 

i 



(19.20) 



Of course, it is necessary to modify Eq. (19.19) so as to take into account the 
reference trajectory as in Eq. (17.133). 

Then, it is sufficient to determine the parameters cq^ci^ Kc^ri so that the 
closed-loop transfer function equal to 

Tii , , 19 . 21 ) 

T^(5) 5^ + Ci5^ + (co + Kc)s + Kc/ti 

presents the desired characteristics, and thus that the poles of the following 
characteristic equation 

T Cl 5^ T (cq T Kq^s T Kq^tj = 0 (19.22) 



are correctly located in the complex plane; this corresponds to a pole-placement. 
Eor this, it is possible to use an optimal continuous polynomial in the sense of 
the ITAE criterion (Table 4.1). 
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19.2.4 State Estimation 

Only the temperature and volume of the reactor contents are assumed to be 
measured, which is often the case in practice. The extended Kalman hlter was 
used to estimate the three states: concentration reactor temperature T 
and jacket temperature Tj. To the theoretical reactor temperature given by 
the simulation model was added Gaussian white noise of standard deviation 
0.1 to simulate the measurement errors. 

The algorithm of the extended Kalman hlter (Sect. 18.4.3) consists of two 
stages: a stage of state and error covariance matrix prediction and a stage 
of correction of the predicted variables. In the prediction stage, the system of 
differential equations describing the derivatives of the states and the derivatives 
of the elements of the covariance matrix is integrated. The correction stage 
modifies the predicted values by adding a term such as 

K{t) 

[^measure d(0 ^predicted (^)] (19.23) 

where K (t) is the Kalman gain matrix calculated from the predicted values and 
y{t) a measured output of the process. 

By adopting the general notations of Eqs. (18.16) and (18.17), the vector 
h defining the output is equal to 

h = [0 ; X2 ; 0 ; 0] (19.24) 

We consider that the level is perfectly known and that the reactor volume is 
not estimated, so that djdx/^ = 0. We calculate the Jacobian matrix (18.21). 
To facilitate the writing, we use the Kronecker symbol: = 1 if i j = 

0 \ii 7^ j. 

The partial derivatives {i = 1, 2, 3, 4) are then equal to 

Fu 
F2^ 



Fu 

a) Realization of the propagation stage: 

The error covariance matrix P is initialized as a scalar matrix of elements 

Pa = {Xi/20 f; Pij = 0 if: i ^ j (19.26) 

while the state noise covariance matrix Q is also scalar and equal to 

- 4 . 



d[fi + 9i u) 
dxi 

d{f2 + 92 u) 
dxj 



d{h + g3 u) 

dxi 

d{f4 + 94 u) 
dxi 



Xi Rx2 

02i ^ k On 

X4 pCp 

AH E „ , UA 



hikxi - 



pCp Rx2 
Pi , UA 

-TThiP 



V, 



Vi Pi a. 



X4pCp 
(<^2i — <^3i) 



(<^2* - hi) (19.25) 



P3 



= 0 



Qii = 10 



Qii = 0 if: i 7^ j 



(19.27) 
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Equation (18.19) is the system constituted by the first four differential equations 
of model (19.9) after having replaced x hj x. 

Equation (18.20) is a system of 4x4 differential equations obtained after having 
replaced F, the Jacobian matrix calculated by Eq. (19.25) and Q previously 
initialized. 

Thus, we integrate numerically this system of (4+9) differential equations 
on a sampling period. We then obtain Xk{—) and Pk{—) • 
b) Realization of the update stage: 

The Jacobian matrix H of /i, which is, in fact, a scalar function, is to be 
calculated. Thus, iT is a vector and its only nonzero element is H 12 = 1. 

The noise Rk in the measurement is a diagonal matrix. In fact, only the 
temperature is measured, thus Rk has dimension 1: R 22 = = 10“^. 

The Kalman gain matrix iT/j, is calculated from Eq. (18.25), then the state 
estimations from Eq. (18.22) 

Ca = ^i(+) 

I = X2{-\~) = 

fj=Xs{^) = 

Vj = Vj 



Xi{-) +iCi2 [X2,: 

X2(-) +F 22 [X2,: 
Xs{-) +iC32 [X2,: 



-X2(-) 

-X2(-) 

-X2(-) 



(19.28) 



We also update the covariance matrix Pk{—) by means of Eq. (18.23) to 
obtain Pk{-\~). 

Then we can restart at stage a). 




Xime (s) 



Figure 19.3: Open-loop response to a pseudo-random binary sequence: tem- 
peratures of the reactor and the jacket 



19.2.5 Simulation Results 

Two types of simulation are realized: in the absence and in the presence of 
chemical reaction. In the first case, only the phenomena of pure heat transfer 
occur. When the chemical reaction intervenes with its heat of reaction, the 
temperature control is influenced. Eor a batch reactor, the reactants are mixed 
only when the desired temperature is obtained and, at that instant, an impor- 
tant heat of reaction is developed in the reactor where a thermal runaway can 
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Figure 19.4: Nonlinear geometric control in the absence of chemical reaction. 
Top: temperatures of the reactor and of the jacket and temperature set point. 
Bottom: valve position 



occur if the temperature increase is not controlled. For a continuous reactor, in 
nominal operation, the heat of reaction is a well mastered normal phenomenon. 

Here, the characteristics of a continuous reactor have been modihed to re- 
semble those of a batch reactor: at the initial time, the reactor is assumed not 
to be at steady state and there is no chemical reaction. It can be compared 
to a continuous chemical reactor at the startup. After heating to the desired 
temperature, the chemical reaction is operated and the heat of reaction thus 
evolved must be controlled. In this case, in terms of control, this heat of reac- 
tion is a disturbance. For a continuous reactor operating normally, the heat of 
reaction would be identihed with the rest of the heat transfer phenomena. 

A hrst open-loop simulation in the absence of chemical reaction in response 
to a pseudo-random binary sequence applied to the valve position allows us 
to evaluate the reactor time constant related to the heat transfer essentially 
between the jacket and the reactor contents. The behaviour (Fig. 19.3) is close 
to that of a hrst-order linear system of time constant around r = 250 s. 

The closed-loop simulation was realized in the following conditions: the 
sampling period is chosen to be equal to 5 s. The control input u varies in the 
interval [0, 1], and is modihed only every 20 s. The parameters of the control 
law are chosen so that the closed-loop linearized system is optimal for the 
ITAE criterion: = 6.8 x 10-^r/ = 4.28, cq = 0.0013, ci = 0.044. The 

integral term of PI control is only active when the deviation between set point 
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Figure 19.5: Nonlinear geometric control with chemical reaction starting at 
t = 1800 s. Top: temperatures of the reactor and of the jacket and temperature 
set point. Bottom: valve position 




Figure 19.6: Nonlinear geometric control with chemical reaction starting at 
t = 1800 s: theoretical, measured and estimated temperatures of the reactor 



and measurement is lower than 2 K, in order to avoid the windup of the PI 
controller. The set point is always filtered by a second-order filter to avoid too 
brutal variations of the control input. Note that this reactor is not considered 
at steady-state at t = 0 and that in all closed-loop figures, the chemical reaction 
starts only at t = 1800 s. 

The initial theoretical states and the estimated initial states are given in 
Table 19.2. In this case, we do not consider possible changes in the feed flow 
rate and reactor volume. 
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Figure 19.7: Nonlinear geometric control with chemical reaction starting at 
t = 1800 s: theoretical and estimated jacket temperatures; temperature set 
point 




Figure 19.8: Nonlinear geometric control with chemical reaction starting at 
t = 1800 s: theoretical and estimated concentrations 



A hrst closed-loop simulation is performed in the absence of chemical re- 
action (Fig. 19.4). The desired prohle is followed without difficulty and the 
corresponding control input varies slowly. 

In the second closed-loop simulation, the reaction starts at t = 1800 s, when 
the desired temperature level is reached. At this time, the highly exothermic 
reaction induces a noticeable temperature overshoot (Fig. 19.5). The return 
towards the stationary is realized with a few oscillations. It would be possible 



Table 19.2: Initial theoretical states and initial estimated states of the CSTR 



Initial state 


Theoretical 


Estimated 


Concentration Ca 
Reactor contents temperature T 
Jacket temperature Tj 
Reactor volume V 


3900 mol/m^ 

320.5 K 
325.0 K 

1.5 m^ 


3300 mol/m^ 
320.0 K 
320.0 K 
1.5 m^ 
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to limit the oscillations by modifying the controller parameters with regard to 
those obtained from the ITAE criterion. The control input still varies relatively 
slowly in spite of the disturbance provoked by the exothermic reaction. 

Now consider the performances of the nonlinear observer. The continu- 
ous reactor is studied in a start stage and is not initially at equilibrium. The 
temperatures and concentrations thus tend towards this stationary state. We 
assume that only the initial measured temperature of the reactor contents is 
known with a small error. On the contrary, the large initial estimation errors 
concern both unmeasured states: the jacket temperature and the concentra- 
tion (10 K for jacket temperature, 20% for the concentration). As this is only 
simulation, the nonlinear model of the reactor represents the theoretical plant 
and it is possible to compare a theoretical variable and the corresponding esti- 
mated variable. The estimation of the temperature (Fig. 19.6), which is also 
measured, shows that the observer convergence is fast and that the deviation 
between the measured temperature of the reactor and its estimated tempera- 
ture is, of course, small, as the initial error is small and the correction uses the 
measured temperature of the reactor. The estimated jacket temperature also 
converges very rapidly towards the theoretical temperature (Fig. 19.7); this 
is explained as the reactor temperature (measured and noisy) and the jacket 
temperature are closely linked by the energy balance. The unmeasured and es- 
timated concentration (Fig. 19.7) first tends rapidly, then more slowly towards 
the theoretical concentration (Fig. 19.8). 



19.3 Biological Reactor 

19.3.1 Introduction 

Biotechnological processes are well known for their operation complexity and 
their difficult understanding related to the nature of living material. Knowledge 
models, which are, in general, non stationary, make use of many parameters and 
are strongly nonlinear. Moreover, the available measurements, in general, are 
not numerous and are of uncertain quality. However, the expansion of biotech- 
nologies imposes development of high-performance control strategies that allow 
us to efficiently monitor these processes and improve their productivity. 

To take into account the nonlinear and unsteady character of bioprocesses, 
a nonlinear control based on differential geometry and global linearization of 
the input-output behaviour seems appropriate (Isidori, 1989). 

Nonlinear geometric control has been applied to biological reactors in par- 
ticular by Hoo and Kantor (1986) and Proll and Karim (1994). 

The described application concerns an industrially important fermentation 
process constituted by a fed-batch reactor of baker’s yeast production (Lucena 
et ah, 1995, 2001). A nearly optimal production of biomass is obtained by 
prescribing to follow a low glucose concentration set point. 
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19.3.2 Dynamic Model of the Biological Reactor 

The dynamic model of the baker’s yeast production process in a fed-batch 
reactor (Fig. 19.9) is a physiological model of growth of Saccharomyces cere- 
visiae (Dantigny, 1989; Rajab, 1983). It relies on the existence of three limit 
physiological states of biomass (Fig. 19.10). 



Glucose 

Feed 



Final 

Level 



Initial 

Level 











[> 

V 


<3 



Figure 19.9: Fed-batch fermentor 




Figure 19.10: Reaction scheme (E: ethanol, S: substrate (glucose), X: 

biomass) 

These physiological states X \ , X 2 and X 3 correspond respectively to the glucose 
fermentation, glucose respiration and ethanol respiration. The model kinetic 
expressions correspond to aerobic growth, in the absence of other limitations 
than that of substrate. The state vector x has dimension 7. It is formed by 
concentrations (g/ 1 ) of ethanol yeast Xi, X 2 and X 3 , glucose 5", acetate A, 
and the reactor liquid volume V (1). 

The dynamic model of the reactor is represented by the following nonlinear 
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system, in the state space, affine with respect to the control input 



E = 



S = 

A = 
V = 



0.45 

R14 



- (1 - <P) 



i?3 






0.55 V 






X\ = R\ + i?5 + i?9 — i?4 — i?8 —Fi^ 

X2 = R2 R4 E R7 — R5 — Re — 

-^3 = Rz E Rs E Rq — R9 — R7 — 



Ri R 2 S Sin 

“ffil4 “ Offi “ V ^ ~V 

Q.iE'^ ^0.55 



(19.29) 



The control input is the glucose feed flow rate u = Fj„, the output is the glucose 
concentration in the fermentor y = S. 

R\, R 2 and R 3 are biomass production rates defined by 



Ri 



R 2 



R 3 






(0.5 + ^)(l + A) 

O. 29 X 2 S' 

(0.04 + S) 

0.25X3 r 



E 



{ 0.02 + E ) "^''"( 0.02 + . 



(^ + 0 : 2 ) 



(19.30) 



i? 4 , i? 5 , Rq, i? 7 , Rs and Rg are the transition rates from one state to another 
one defined by 



i?4 = 2 X 1 {I -a) 
OAX 2 E 



Ra = 



Rr = 



0.5 + X 
0.2 XiX 
0.2 A E 



i?5 =0.1 X 2 
2 X 3 ^ 



R7 = 



(1 - a) Rg = 



0.05 + 5^ 
0 . 5 X 36 ^ 
0.01 + 5^ 



(19.31) 



The rates i?i, i? 2 , • • •, R 9 are expressed in g/(l.h), the dimensionless coefficients 
a and (/) are defined as follows 



- 

O.ffi -\- /-IQ o9\ 

r (^ = 1 if X < 10-^ g/l (19.32) 

\ (^ = 0 ifx>io-Vi 



Thus, the model is highly nonlinear. 
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19.3.3 Synthesis of the Nonlinear Control Law 

The dynamic model (19.36) of the biological reactor is a SISO nonlinear analytic 
system, affine with respect to the control input, which can be written as 



X = f{x) + g{x)u 
y = h{x) 



(19.33) 





X\ 




’ flix) 




’ 9iix) 


X = 




fix) = 




gix) = 






_ Xn _ 




. fnix) _ 




. 9nix) _ 



(19.34) 



with 



x=[E Xi X 2 X 3 S A Vf 



(19.35) 



fix) 



0-45 „ _ ,, Rs 

014-^1 ( ‘^^ 0.55 

Ri + -R5 R9 ~ R4 ~ Rs 

R 2 R 4 R? ~ Rb ~ Rq 
R 3 Rs Rq ~ R 9 ~ R 7 
Ri R 2 



0.01 



0.14 



Ri- 



;a 

^0.55 





E - 










; gix) = 


A 

T r 




Sin-S 




\ 


- 


V 

1 



; h{x) 



0 

0 

0 

0 

5^ 

0 

0 

(19.36) 



X, y, IX, / and g are respectively the state, output, control input, vector field 
of the dynamics and vector field of the control. 



Relative Degree of the System 



The relative degree of a nonlinear system is defined by (17.55). 
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In the present case, the Lie derivatives of the output y = S are 



ip= 

L}S = 




014-^1 ( '^)o .55 

Ri + -R5 R9 ~ R4 ~ Rs 



'dS dS dS dS dS dS dS 

M ^ ^ ^ W 



Ri R2 
^ “ 0^ 



LgL^fS 



= LgS 



R2 R4 R? ~ Rb ~ Rq 



R3 Rs Rq ~ R9 ~ R? 



Ri 

0.14 



0.01 

R14" 



':R^ 



0.5 

0.55 



(19.37) 




dS dS dS dS dS dS dS 
_ ^ ^ ^ ^ W 



= 7 ^ 0 as Sin ^ S 

The relative degree of system (19.36) is thus r = 1. 



E - 

~V 

R 

S^n-S 

V 

A 

~V 

1 



(19.38) 



Synthesis of the Nonlinear Control Law 

For a system of relative degree equal to 1, according to Isidori (1995), the state 
feedback (Eq. 17.101), which makes the closed-loop input-output behaviour 
V — y linear, is 

zi — Lfh{x) 

Lgh{x) 



u = 



(19.39) 
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Figure 19.11: Nonlinear geometric control of the biological reactor with pole- 
placement 



In fact, we realize a pole-placement (Fig. 19.11) by introducing a feedback (Eq. 
17.119) such that 



V — coh{x) — Lfh{x) 

Lgh{x) 



(19.40) 



where cq is a constant scalar parameter. Again, it is necessary to modify Eq. 
(19.40) so as to take into account the reference trajectory as in Eq. (17.133). 

The external input v is dehned by means of a PI controller, which allows 
us to eliminate the stationary error eventually due to the modelling errors and 
the unmeasured disturbances, etc. 



v = K, 



1 A 

{yr{t) - y{t)) H / {yr{r) - y{T))(h 

Ti Jo 



(19.41) 



where Kc and r/ are respectively the proportional gain and the integral time 
constant. The transfer function of the closed-loop system is then the following 



T(s) ^ K.s^Kjn 

Yr{s) s2 + (co+iEe)s + iCe/r/ 



(19.42) 



The scalar parameters iEc, r/, cq are chosen in order to approach a polynomial 
minimizing an ITAE criterion; we verify that the following polynomial is Hur- 
witz (the poles have a negative real part) to ensure the closed-loop stability 
related to the roots of the characteristic equation 



+ (cq + Kc)s + Kcjrj — 0 



(19.43) 



19.3.4 Simulation Conditions 

The results have been obtained in simulation. The fermentation is realized in a 
fed-batch reactor, fed with glucose at concentration Sin = 300 g/1 (Fig. 19.9). 
The initial conditions are 

^ = 0.0 g/1, Xi = 3.0g/1, X2 = 0.0g/1, X3 = 0.0g/1 

5^ = 0.0 g/1, A = 0.0 g/1, V = 10\ 

An actual fermentation is performed in two stages. The hrst stage is run in 
batch mode and the yeast hrst consumes the glucose initially present, then 
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the ethanol that it produces. When the ethanol concentration becomes close 
to zero, the closed-loop control can start, the glucose feed is realized and the 
fermentation is then performed in fed-batch mode. 

The maximum volume maximal of the reactor is equal to 20 1. The maxi- 
mum acceptable feed flow rate is 5 1/h. 

In the case of yeast production, the set point is a glucose concentration 
equal to Sc = 0.07 g/1. Indeed previous studies have shown that this set point 
value is nearly optimal for maximizing the biomass productivity (Dantigny, 
1989). 

In order to simulate a real experience, to the ethanol and glucose measure- 
ments, a Gaussian noise of respective deviations 5 x 10“^ g/1 for ethanol and 
5x10“^ g/1 for glucose is added to the value provided by the theoretical model. 

At the initial instant, the glucose concentration is zero; immediately after- 
wards, the glucose concentration set point must follow a step equal to 0.07 g/1. 
To avoid rapid variations of the control input, the set point was filtered by an 
overdamped second-order filter (r = 5 , (“ = 1.2), so that in reality the output 
must follow this reference trajectory. 

In the case of a real fermentation, the sampling period of the glucose con- 
centration measurement is 15 minutes. The actual value of this glucose con- 
centration in the reactor is obtained about 15 minutes after the corresponding 
sampling. However, a measurement of the ethanol concentration is possible 
every 3 minutes by means of a volatile component sensor. With the control 
variable, i.e. the glucose feed flow rate, being calculated every 3 seconds, a 
state observer is necessary to reconstruct the measured states as well as the 
unmeasured ones (Bastin, 1988; Bastin and Dochain, 1990; Bastin and Joseph, 
1988; Lucena et ah, 1995; Watanabe, 1992) based on these measurements. In 
the present case, the extended Kalman filter is used with the nonlinear process 
model. Some states are initially supposed to be known with a given error. The 
initialization of the estimations was taken as 

^0 = 1-0 g/1, Xio = 3-lg/l> Wo =0-0 g/1 > Wo =0-0 g/1 
^0 = g/1 , io = 0.0 g/1 , Co = 10 1 

(19.45) 

The volume is perfectly known and for this reason is not estimated. 

The parameters used for the control law are 

Co = 5.655 Kc = 0.628 n = 0.0637h (19.46) 

To avoid the eventual windup effects due to the integral action present in 
the control law, when the deviation between set point and measurement, taken 
as an absolute value, is larger than a given value (here 0.04 g/1), the integral 
action is cut. It is reintroduced when the deviation becomes smaller than the 
fixed threshold. 

In a real process, we avoid modifying the position of the valve too frequently. 
A period of constant value equal to 3.6 s was assumed for the fermentor. 
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19.3.5 Simulation Results 

The glucose concentration (Fig. 19.12) starts from 0 g/1 and increases up to 
its set point with little overshoot, following its reference trajectory. In the 
following, the value of the measured output oscillates between 0.06 and 0.08 
g/1. It is possible to verify that the estimation always remains in the conhdence 
interval of the measured value. The oscillations visible at the end are probably 
due to the bad estimation of acetate, whose influence in the model begins to 
become sensitive. 

The ethanol concentration (Fig. 19.13) starts from 0 g/1 and goes through 
a maximum of around 0.4 g/1 before going back to 0.07 g/1, corresponding to 
the stoichiometric transformation of glucose into ethanol. This is favourable 
for good yeast productivity. 

The total biomass concentration ( Xi + X 2 + X 3 ) is represented in Fig. 

19.14. After 15 hours of fermentation, the biomass concentration reaches 33 
g/1. The desired productivity could even be more important. In the present 
case, a time limitation was imposed on the fermentation. If the latter were to 
continue, the exponential allure of biomass growth would be maintained, but a 
volume limitation would then be imposed. Beyond some biomass concentration, 
practical problems such as oxygenation of the fermentation medium occur. 

The evolution of the manipulated glucose feed flow rate is presented in Fig. 

19.15. The flow rate increases in a slow and regular manner. A remarkable 
characteristic is the exponential allure of the control prohle which demonstrates 
the highly nonlinear behaviour of this system and renders classical linear control 
very difficult. The flow rate remains largely lower than its maximum authorized 
value (30 1/h). The evolution of the liquid volume (Fig. 19.16) of course follows 
that of the feed flow rate. The constraint of maximum volume is not reached 
after 15 h. 

19.3.6 Conclusion 

The nonlinear control law designed in the framework of differential geometry 
allows us to fully exploit the knowledge of a complex process, presently the 
production of Saccharomyces cerevisiae yeast during a fed-batch fermentation. 
The desired objective is to maximize the productivity. 

The obtained control presents very good properties, although the state- 
space model of the process is highly nonlinear and non-stationary. 

One important problem is the synthesis of the state observer necessary for 
reconstructing the unmeasured states. The met difficulties are, in particular, 
important modelling uncertainties, inaccurate measurements, different sam- 
pling periods and measurement delays (Lucena et ah, 1995). 




Concentration (g/1) Concentration (g/1) 
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Figure 19.12: Measured and estimated glucose concentration 




Figure 19.13: Measured and estimated ethanol concentration 
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Chapter 20 

Distillation Column 
Control 



Distillation columns are unit operations that are very common in chemical, 
petrochemical and even sometimes in metallurgical industries. Moreover, they 
consume a large part of a plant’s total energy. The optimization of their design 
and operation is thus an essential objective. The more and more severe oper- 
ation constraints which are imposed make their mastering more delicate and 
implies impressive control strategies. 

20.1 Generalities for Distillation Columns 
Behaviour 

The purpose of a distillation column is to separate a multicomponent feed into 
products of different compositions or to purify intermediate or final products. 
The possibility of distillation relies on the volatility difference existing between 
different chemical species. If we consider a simple vessel containing a mixture 
of two components (binary mixture) and if its contents are heated, there occurs 
an equilibrium ruled by thermodynamics between liquid and vapour which then 
have different compositions: the vapour is richer in the more volatile component 
(“light”) and the liquid richer in the less volatile component (“heavy”). In a first 
approximation, the most volatile component is that which possesses the lowest 
molecular weight. Note that in the case of a multicomponent mixture, such 
as a mixture of hydrocarbons, the prediction of the respective volatilities is 
more delicate, but can be performed by most thermodynamic actual codes 
(Prausnitz and Chueh, 1968; Prausnitz et ah, 1967). Through this equilibrium, 
a separation operation called fiash is realized, which can be considered as the 
operation that occurs at the level of each tray of a distillation column. When 
the separation by a simple fiash is insufficient, the separation is operated in 
a distillation column which will allow better separation due to the series of 
trays. A packed column, in fact, behaves in an analogous manner to a real tray 
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Figure 20.1: Distillation column 



column, but in that case the notion of theoretical separation stage is used. 

The technology of distillation columns can be complex and this review is 
limited to the classical case of a column having only one feed, producing the 
bottom product at the reboiler and the distillate or overhead product at the 
condenser (Fig. 20.1), without any side withdrawal. The distillate can be 
seen as the production of “light” and the bottom product as the production of 
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“heavy”. A multicomponent mixture will thus be considered as being composed 
of two key-components, a light component and a heavy component, as if it were 
a binary system. 

Many possibilities for more complex distillation are used. Thus, the distilla- 
tion columns can have several side streams, allowing us to realize intermediate 
cuts (petroleum distillation). There can exist extractive distillation columns 
(additional solvent feed) (Lang et ah, 1995) or reactive distillation (with che- 
mical reaction) (Albet et ah, 1991). On the other hand, there exist batch 
distillation columns in which the feed is introduced into the reboiler at the ini- 
tial instant, and for which a control policy must be dehned to allow for optimal 
separation of the products with desired purities at different instants. 

Here, we are concerned with a distillation column that we can qualify as 
classical and representative of a large number of industrial columns (Fig. 20.1). 
The role of the reboiler situated at the bottom of the column is to bring to the 
whole column the energy Qb necessary for the separation operation, corres- 
ponding to the vapourization enthalpy. The condenser can be total or partial, 
depending on whether it condenses the totality or a part of the vapour arriving 
at the top of the column into a liquid that is separated between the distillate 
and the reffux. In the simplest case constituted by the total condenser, the ab- 
sorbed heat Qc (supply of cold) allows us to exactly perform the condensation 
of the head vapour. The reboiler and the condenser are, in fact, heat exchang- 
ers. The feed F can be introduced into the column at different enthalpy levels 
(subcooled liquid, saturated liquid, liquid-vapour mixture, saturated vapour, 
overheated vapour). Frequently, the feed is a saturated liquid (at the boiling 
point), which will be our hypothesis. 

A column is traditionally divided into two large sections: above the feed 
tray, the rectifying section and below the feed tray, the stripping section. In 
general, a column is operated at a fixed top pressure in order to ensure an 
effective liquid- vapour equilibrium on all the plates. A temperature gradient is 
then established in the column, the reboiler being at the highest temperature 
and the top plate at the lowest temperature. 

A column tray can be realized in different ways, e.g. a valve tray, bubble cap 
tray, sieve tray (Rose, 1985). The tray is provided with a weir (Fig. 20.2). A 
given liquid level coming from the downcomer is thus maintained on the plate. 
The vapour coming from the lower plate pushes on the valves (for a valve tray) 
and crosses the plate liquid, thus enriching itself in volatile components. On the 
other hand, the liquid passing over the weir is enriched in heavy components 
and fiows down to the lower plate. 

The column is thus crossed by a downward liquid stream and an upward 
vapour stream. The plates of the column are numbered in ascending order: the 
reboiler is 1, and the top plate n. The condenser can only be considered as 
a theoretical stage if it is partial, i.e. if it realizes a liquid-vapour equilibrium 
and for this reason is considered separately. Any plate i is thus modelled (Fig. 
20.3) by means of entering and exiting streams; those entering: the liquid 
comes from the upper plate i I and the vapour Vi-i comes from the lower 
plate i — 1; those exiting: the liquid Li and the vapour Vi have the number 




708 



Chapter 20. Distillation Column Control 




Figure 20.2: Distillation column plate 



of the plate and are considered to be in equilibrium in the modelling. In the 
calculation of a real column, an efficiency coefficient (Murphree efficiency) is 
introduced to take into account the deviation with respect to equilibrium. 



Plate i + 1 





Plate i 






Plate i — 1 





Figure 20.3: Plate model 



20.2 Dynamic Model of the Distillation Column 

The modelling of a distillation column can be performed at different complexity 
levels with regard to the objective. Models called rigorous (Skogestad, 1992, 
1997) concern models which include the mass and energy balances at each 
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stage, a dynamic model of the liquid flow and a model of the pressure dynamics, 
possibly a model of the reboiler and the condenser. In fact, even in a rigorous 
model, simplihcations are introduced, such as the perfect mixing at each stage 
and the thermal and thermodynamic equilibrium between phases (a Murphree 
efficiency per plate may be introduced to correct for non-equilibrium). 

With the considered distillation having Uc components, the equations for 
each theoretical stage i of the column include: 

• The mass balance per component j {uc equations): 

dMi Xij _ 

T Vi—iyi—ij LiXij yiViJ (20.1) 

• The energy balance (1 equation): 

^ Li+, + hU Vi-i - h\ Li - h\ Vi (20.2) 

• The thermodynamic equilibrium for each component j {uc equations): 

g(Tj, Pi, Xij) = n'j{Ti,Pi, yij) yij = k{xij) (20.3) 

where Xij and tjij respectively indicate the liquid and vapour mole fractions in 
component j at plate i. Li and Vi are respectively the liquid and vapour molar 
flow rates leaving plate i. and are respectively the liquid and vapour 
enthalpies. and /ij are respectively the liquid- and vapour-phase chemical 
potentials in component j. The global mass balance results from the Uc mass 
balance equations per component. 

The number of accumulated moles Mi and the accumulated internal energy 
Ui are calculated by using the description of the liquid holdup on the plate, thus 
its hydrodynamic behaviour. Gani et al. (1986) give many hydrodynamic corre- 
lations that allow us to calculate the liquid and vapour flow rates, the pressure 
drop, the conditions of flooding, weeping, of liquid and vapour entrainment, for 
different types of plates. 

Besides any plate i, the feed plate, the reboiler and the condenser (top 
drum) must be described according to analogous equations, but with taking 
particularities into account. For a detailed description, it is possible to refer 
to many general books of chemical engineering (Holland, 1981). However, few 
books treat dynamic problems in a detailed manner. 

A rigorous description allows us to obtain a dynamic reference model. In 
many physical cases, important simplihcations are brought to the previous 
model. Skogestad (1997) details the implications of the modelling simplihca- 
tions, concerning in particular the vapour dynamics, energy balance, and liquid 
how dynamics. The simplihed model can also be used by the control specialist 
as a nonlinear model in view of the design of a control law. 

The model described below is not a rigorous model and the differences with 
a rigorous model are underlined. On the contrary, it is well adapted to nonlinear 
control (Greff, 1992). We consider the column as a binary column, separating 
a heavy component and a light component. The retained hypotheses are: 
a) Of thermodynamic order: 
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— The liquid and the vapour are homogeneous and at thermodynamic equi- 
librium on each plate. 

— The partial molar enthalpy of a component j is independent of pressure, 
temperature and composition, thus is constant in each phase for any plate 
i. This can be applied when the components have close chemical properties. 

— The partial molar enthalpies of vapourization are identical for all compo- 
nents. 

b) concerning the column: 

— The pressure is constant on each plate, thus the hydrodynamic model be- 
comes useless. 

— The molar liquid holdup is constant on each plate. 

From all these hypotheses, it results that the molar flow rates Li and Vi will 
be constant in each section of the column, thus on each side of the feed, in 
steady-state regime as well as in dynamic regime. In the following, they are 
respectively denoted by L and V. 

An important consequence is that the energy balance equation becomes use- 
less. There remain the mass balance equations, which take simple expressions 
as the feed is considered to be binary. The expressions are written with respect 
to the most volatile component. The simplified model is written keeping the 
hypotheses of the total condenser and the saturated liquid feed: 

Condenser (^i = n -\- I = D): 



Md 



dxD 

dt 



Vyn-VXD 



Rectification plates {i = ip + 1, . . . , n): 




Lxipi + V yi-i - Lxi-Vyi 



(20.4) 



(20.5) 



Feed plate {i = ip)'- 
dx ■ 

Mip = Lxip+i + V yip-i + F ZF - {L + F)xip -V yip ( 20 . 6 ) 

Stripping plates {i = 2, . . . ,ip — !)■ 

Mi^ = {L + F) Xi+i +Vyi.i-{L + F)xi-Vyi ( 20 . 7 ) 

Reboiler {i = 1 = B): 
dx^ 

Mi^ = {LvF)x2-{LvF-V)xi-Vyi (20.8) 

It must be noted that in these equations, on each plate, the vapour phase 
mole fraction yi is a nonlinear function of the liquid phase mole fraction 
calculable by the relations of thermodynamic equilibrium (Prausnitz et ah, 
1967 ). 
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Many papers deal with the numerical aspects related to the solving of dis- 
tillation models. Indeed, a rigorous dynamic distillation model includes a large 
set of ordinary differential and algebraic equations. The problem formulation 
has important consequences on its solving (Cameron et ah, 1986). The problem 
structure makes a hollow and band matrix appear. Moreover, Cameron et al. 
(1986) distinguish two modes of simulation according to whether the column is 
operated in continuous mode (in the neighbourhood of a stationary point) or in 
discontinuous mode (startup or shutdown or feed change (Ruiz et ah, 1988)). 
In discontinuous mode, the transients are much more severe and sequential op- 
erations may occur. In continuous mode, however, we must be cautious about 
the hooding and weeping hazards inducing operating discontinuities. Cameron 
et al. (1986) evaluate the necessity of using numerical integration schemes of 
stiff systems. Generally, semi-implicit schemes of the Runge-Kutta type or 
totally implicit, based in general on Gear’s method, are recommended (Gal- 
lun and Holland, 1982; Gear and Petzold, 1984; Hindmarsh and Petzold, 1995; 
Unger et ah, 1995). 

Although the model of a packed column, either for distillation or absorption, 
is a system of partial differential equations, it is common to model such a 
column by using the notion of theoretical stage, and the problem becomes 
identical to that of a distillation column with theoretical plates. It must be 
noted that the dynamics of a packed distillation column is neatly faster than 
that of a plate column, as the liquid holdup is lower (frequently by a factor of 
two or three) (Skogestad, 1992). 

Skogestad and Morari (1988) show that the dynamic behaviour of a distil- 
lation column is that of a two time-constant system. The composition response 
to external ffow rate variations (which makes the ratio DjB vary) is close to 
that of a first-order system with time constant ri, which can be very large if 
the exit products are very pure. Gonsider the infiuence of external ffow rate 
variations {F or D or B) on the global mass balance of the column. Assuming 
that the molar holdup is constant for any plate and that the column goes from 
one initial steady state to a final steady state (subject to a variation A), the 
time constant Tc is equal to 



DfAyn + BfAxs 



(20.9) 



This time constant Tc is the contribution of the holdup inside the column, 
in the reboiler and the condenser. Equation (20.9) does not correspond to a 
linearization and allows us to obtain an excellent agreement with the nonlinear 
responses, even for high-purity columns. For binary columns, the linearized 
formula 



Me 

[Bxb{1 - xb) + Dyoil - yo)] Ins' 



with: S = 



yp{l - Xb) 

(1 - yD)XB 



( 20 . 10 ) 



provides a good approximation of Tc while demonstrating the purity infiuence 
through the separation factor S. The response to internal ffow rates variations 
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(which modify L and V while keeping D and B constant) is also first-order, but 
its time constant T 2 is much smaller than ri; however, its influence is important. 
The time constant T 2 corresponds to a variation of internal flows and is near 
Mc/F. Thus, it is possible to correctly represent the dynamic behaviour of 
composition responses by an analytical model using two time constants. More- 
over, the use of the top logarithmic compositions, log(l — yn)^ and bottom 
ones, \ogXB, nearly totally cancels the nonlinearity. The introduction of feed- 
back to control the column modifies the poles and the closed-loop responses 
can be largely faster than could be thought from the time constants ri and T 2 
corresponding to open-loop responses. 

A reduced linear model is often sought for control; however, the used sim- 
plifications can be excessive. Skogestad (1992) recommends realizing a model 
reduction from a linearized model which is itself deduced from a complete rig- 
orous model. 



20.3 Generalities for Distillation Column 
Control 



The most common objective in a distillation process is to maintain the top and 
bottom compositions at a desired specification. 



L 

(or Qb) 
Vn (or Qc) 

D 

B 







Distillation 




Column 

















xd 

Xb 

p 

Mb 

Mb 



Figure 20.4: Block diagram of a distillation column 

A typical distillation column (Fig. 20.1) can be represented as a block diagram 
(Fig. 20.4) which possesses five control inputs u corresponding to five valves 
(flow rates of distillate bottom product 5, reflux L, reboiler vapour Vi 
(indirectly manipulated by the heat power at the reboiler Qb)^ top condensed 
vapour Vn (indirectly manipulated by the power withdrawn from the condenser) 
and five controlled outputs y. Three of the controlled variables (holdup at the 
reboiler Mb and at the condenser Mb, and pressure P or vapour holdup My) 
must be carefully controlled to maintain the stability of the operation. In fact, 
Jacobsen and Skogestad (1994) show that even binary columns whose pressure 
and levels are controlled can present multiple stationary states. There remain 
two degrees of freedom for the top and bottom compositions x b and x b • The 
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other inputs are those which are not affected inside the system: the disturbances 
d and the set points y^. The disturbances in the column, in general, are related 
to the feed (ffow rate F, feed enthalpy expressed with respect to the liquid 
fraction kp and the feed composition zp)- In a general manner, disturbances 
are classified as measured disturbances (e.g. often F) and unmeasured (e.g. 
often zp). The set points can change, for example, when an on-line economic 
optimization is performed. The outputs can be measured (pressure P, holdup 
Mb and Mp, top and bottom compositions with time delay); besides, the 
temperature is generally measured at several locations. For the distillation 
column, we can consider that the state variables are the liquid and vapour 
mole fractions at the level of each plate; unfortunately, in general, they are 
not measured inside the column. The temperature measurement on sensitive 
plates allows us, by means of the thermodynamic equilibria and knowing the 
pressure, to estimate the profile of mole fractions. 

The dynamics of the five outputs are of different orders, which allows hi- 
erarchical control. Indeed, the time constants of the pressure dynamics and 
the reboiler and condenser levels dynamics are smaller, i.e. around a few min- 
utes. On the contrary, the time constants of the top and bottom mole fractions 
dynamics are largely higher, i.e. able to reach several hours. 

Problems encountered by this coupled control of top and bottom mole frac- 
tions, increased in the case of high-purity columns, are: 



— The strongly nonlinear behaviour of the process. 

— An often slow response. 

— Measurement problems (time delays for composition measurements). 

— The difficult choice of the control variables for composition control. 

— A highly interactive system. 

— The system is badly conditioned, particularly for high-purity columns. 



The example of the design of a control system for a distillation column is re- 
markable in order to show the importance and the respective domains of system 
analysis, modelling and identification, the choice of a control algorithm. This 
problem has been studied very often and many control approaches applied to 
a distillation column have been proposed along the past years. We will re- 
call some of them here which seem particularly interesting with regard to the 
methodology and evolution that they represent. The range of proposed solu- 
tions may leave more than one engineer thoughtful. Successively, single-input 
single-output control, multivariable control by decoupling (1970), by transfer 
function matrix (1975), auto-adaptive control (1981), bilinear models (1978), 
model predictive control (since 1978) and nonlinear control (1990) will be re- 
viewed. 
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20.4 Different Types of Distillation Column 
Control 

20.4.1 Single-Input Single-Output Control 

The simplest used approach consists of controlling only one composition, ge- 
nerally top yD‘ This is a single-input single-output control and, in this case, 
a flow rate is manually controlled by the operator. Taking into account strict 
specifications, dual control, which is aiming to control both the top xd and 
bottom xb mole fractions, is desirable and will be presented in the following. 
According to Skogestad and Morari (1987a), dual control allows us to spare 10 
to 30% energy by avoiding over-purification and out of norms product loss. 

20.4.2 Dual Decoupling Control 

One of the main difficulties in multivariable feedback control lies in the inter- 
action of the control loops, e.g. a modification of the reboiler vapour flow rate 
Vi to control the composition xb also influences xd (Fig. 20.5). Different so- 
lutions were proposed, e.g. using a ratio between the reflux and the condenser 
vapour to reduce interaction. Luyben (1970) studied the decoupling from a 
linear model of the column. 




Figure 20.5: Interaction inside a distillation column 



For distillation, we get a linear model expressing the outputs Y with respect 
to the inputs U and the disturbances D 



Y = GuU ^GdD 



(20.11) 



Xd 




Gu,ii 


Gu, 12 


_Xb _ 




_ Gu, 21 


Gu,22 _ 



L 

Vi 



+ 



Gd,ii 
Gd, 21 



Gd, 12 

Gd,22 





Zf 




F 



( 20 . 12 ) 



We notice that xd i^ simultaneously influenced by L and Vi, similarly for xb 
with regard to L and Vi- The block diagram (Fig. 20.6) provides the following 
equation 



Y — [I -\- Gu Gr] ^Gu Gr Y r -\- [I -\- Gu Gr] ^ Gd D 



(20.13) 
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Figure 20.6: Closed loop without decoupling 




Figure 20.7: Closed loop with decoupling 



To eliminate the interaction between the control loops, decoupling elements 
De are added which give the new scheme (Fig. 20.7) expressed by the equation 

Y = [I + GuDe Gr]~^Gu De Gr Yr + [I + G„ De Gr]~^ Ga D (20.14) 

i.e. 

Y = KiYr^ K2D (20.15) 

For the control to be non-interactive, the matrix Ki of the closed-loop transfer 
function must be diagonal and specihed. Luyben deduces the decoupling matrix 

De = [Gu]~^ [dmgGu] (20.16) 

According to Luyben, the realized experiences show the efficiency of this de- 
coupling. An application to the linear model of distillation column (Wood and 
Berry, 1973) is presented in Example 8.1. 

However, a noticeable drawback of the decoupler thus calculated is that it 
does not take into account the disturbances, e.g. in front of a feed flow rate 
disturbance, the decoupled system behaves in a worse manner compared to the 
undecoupled system. 

On the contrary, Niederlinski (1971) shows that it can be interesting to 
use interaction because it can offer a better disturbance alleviation than the 
non-interactive design. Moreover, in some cases, Luyben’s decoupler can be 
unrealizable (Weischedel and McAvoy, 1980). 




716 



Chapter 20. Distillation Column Control 



A robustness analysis by singular value decomposition concerning decou- 
pling control of distillation columns (Sect. 8.6) was realized (Arkun and Mor- 
gan, 1988; Arkun et ak, 1984). For high-purity columns, the system is badly 
conditioned, i.e. the singular values have very different orders of magnitude. 
This property explains many characteristics, and Skogestad (1992) strongly ad- 
vises against using decoupling when the model has RGA matrix elements that 
are large compared to 1. Control has been analyzed in the robustness context 
(Skogestad and Lundstrom, 1990; Skogestad et ah, 1988). 

20.4.3 The Column as a 5 X 5 System 

Skogestad and Morari (1987a) show that although a distillation column is 
strongly nonlinear, for control needs it can be described by a linear model. 
The composition responses are much slower than those of the top and bottom 
flow rate, which makes the open-loop transfer functions difficult to obtain by 
simulation. However, as the composition response depends very little on the 
levels in the reboiler and the condenser, these levels are often considered con- 
stant to find the response. Pressure has an important effect on composition, but 
is well regulated. With these approximations, the structure of the open-loop 
transfer matrix is given in Table 20.1. 



Table 20.1: The distillation column seen as a 5 x 5 system 



Controlled 




Manipulated variable 






output 


L 


H = Cl 


D 


B 


Vn 


Xd 




Gu,12{s) 


0 


0 


0 


Xb 


Gu,2i{s) 


Gu,22{s) 


0 


0 


0 




1 




1 




1 


Md 




0 




0 




Mb 


s 

- exp (—Os) 


—1 — A (1 — exp(— 6 ^ 5 )) 


s 

0 


1 


s 

0 






Mv 


s 

0 


5 

1 


0 


s 

0 


1 


s -\- kp 


s -\- kp 









The following hypotheses were retained: 

• Constant molar flows. 

• The transfer function for My is not a pure integrator because of the con- 
densation effects included in kp. 

• exp (— 6 ^ 5 ) with 0 = tl N is an approximation for 1/(1 + r^s)^, 

tl = (dMijdL^y. is the hydraulic time constant, N being the total number 
of plates. 

• A = {dLij dVi)Mi is the variation of liquid flow rate Li due to a variation of 
vapour flow rate Vi. 

• Moreover, dynamics will occur in particular for the valves modifying L, V, 
H, B and Vn. 
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We notice from the matrix of Table 20.1 that the outputs xd and xb depend 
only on the inputs L and V through a 2 x 2 sub-matrix G. Similar transfer 
matrices will occur for the disturbances. 

In general, a 5 x 5 controller will not be used, but a 2 x 2 controller denoted 
by K for composition control, plus a control system for levels and pressure. If 
the flow rates L and V are chosen as control variables for composition control, 
the control conhguration LV is represented by the following matrix 

dL K ^ 0 0 dxD 

dV 0 0 0 dxB 

dD = 0 0 cd{s) 0 0 dMo (20.17) 

dB 0 0 0 cb{s) 0 dMB 

dVfi 0 0 0 0 Cy(5) dMy 

The flow rate inputs L and V depend simultaneously on the compositions 
Xd and xb^ whereas the inputs Vn depend on only one output, respec- 

tively the holdups Md^ Mb and My. 

Instead of simply using L and V as control variables, the ratio L/V (which 
is the slope of the operating line in MacCabe and Thiele diagram) and V/B 
can be used. This recommended conhguration (Shinskey, 1984) corresponds, 
in fact, to a nonlinear control scheme. The controller becomes 




(20.18) 



We notice that the how rates L and V depend both on compositions {xd 
and Xb) and on levels {Md for L and Mb for V). When the holdup Md varies, 
the controller modihes both L and similarly for Mb and V and B. Each of 
the SISO level controllers modihes two how rates. 

Many control conhgurations are possible; among the factors that guide the 
choice (Skogestad and Morari, 1987a) are: 

— The uncertainty: the analysis by relative gain array (RGA) (Shinskey, 1984; 
Skogestad and Morari, 1987b), allows to display the input-output couplings. 
Originally, RGA is a measure of steady-state interaction and depends on 
the process model, thus is independent of the control. It has been extended 
to dynamic interaction. A drawback of RGA analysis is that it does not 
take into account the disturbances. Analogous methods considering the 
disturbances can be used in closed loop (Hovd and Skogestad, 1992). 

— The importance attached to set point tracking with respect to the inhuence 
of disturbances. 

— The dynamic considerations (level loops inhuencing the how rates L and 

V- 

— The how rate disturbance rejection (major disturbances: F, enthalpy 
E, Gi, rehux temperature Tp). 

— Manual control of only one composition. 
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— The transition from manual to automatic control. 

— The constraints (on the flow rates and the holdups: levels and pressure). 

— Level control. 

These factors can be contradictory. 

(Waller et ah, 1988) studied four control structures with regard to sensiti- 
vity: 

— The usual scheme of energy balance (L^V) with the levels (condenser and 
reboiler) controlled by D and B. 

— The material balance scheme {D, V) with the levels controlled by L and B. 

— Ryskamp scheme {D/{L -\-V)^ V) with the levels controlled hjL-\-D and 
B. 

— The two-ratio scheme {D/{L + V)^V/B) (Shinskey, 1984) with the levels 
controlled hy L ^ D and B. 

In open loop, the structure (L, V) is the less sensitive and the structure V) 
the more sensitive to the feed composition disturbances. In closed loop, the 
quality of structure (T, R) decreases while remaining acceptable. The two-ratio 
structure gives worse results than predicted by RGA analysis because of the 
pure delays present in the transfer functions. The Ryskamp scheme that pre- 
sented a RGA gain near to 1 like the two-ratio scheme gives the best results. 
The configuration V) is, in fact, equivalent to a compensator obtained by 
singular value decomposition (Bequette and Edgar, 1988): the strong direction 
(related to the largest singular value) corresponds to the modification of the 
external flow rates (D — B)^ which particularly influences the mean compo- 
sition {xd + x^)/2, while the weak direction (related to the lowest singular 
value) corresponds to the modification of the internal flow rates (L + E), which 
especially influences the composition difference {xd — xb)- 

According to Skogestad et al. (1990), who studied four configurations: (L, E), 
{D, E), {D, B), {L/D^V /B) by the frequency RGA method, the best structure 
for most columns would be the two-ratio configuration {L/D^ ftie case 

of dual control. 

The use of a feedforward controller that takes into account the flow rate F 
disturbance is attractive but difficult to apply, as the high-purity distillation 
columns are very sensitive to the inventory errors. The combination with a 
feedback controller is absolutely necessary (Skogestad, 1992). 

Note that Levien and Morari (1987) performed internal model control of 
coupled distillation columns considered as a 3 x 3 system. 

20.4.4 Linear Digital Control 

A dynamic identification of the studied column is essential for obtaining the 
discrete-time linear model of the process. In this domain, few studies are avail- 
able. Defaye and Garalp (1979) present an example of compared identification 
on an industrial depropanizer where the outputs are sensitive temperatures and 
not mole fractions. An observable disturbance is incorporated in the model. 

Defaye et al. (1983) used predictive control to control a component concen- 
tration on an internal plate of an industrial column. The proposed controller 
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is, in fact, a dead-beat one. 

Sastry et al. (1977) and Dahlqvist (1981) used the basic self-tuning con- 
troller developed by Astrom and Wittenmark (1973) to control a pilot distil- 
lation column. In the case of a single-input single-output process, the model 
used for on-line identihcation is 



^)y{t)=hB{q ^)u{t - k - 1) + e{t) 

with /c as a pure delay, e{t) a disturbance, and the polynomials 

^) = 1 ^ CLi q ^ + ... + CLfia Q 

B{q-^) =l+hxq-^ + ... + hn„q-'^” 



(20.19) 



( 20 . 20 ) 



In the single-input single-output case, the self-tuning controller algorithm 
is performed in two stages: hrst, estimation and then the calculation of the 
control law. 

Parameters are estimated by the recursive least-squares algorithm (Ljung 
and Soderstrom, 1986); hrst the parameter vector 6(t) is dehned 

9{t) = [ai,...,a„,^,6i,...,5„J^ (20.21) 

and the observation vector (j)(t) 

4>{i) = [-y(^)> ■ ■ ■ > -y{i -na + 1), hou{t - 1), ... , hou{t - nf,)]^ (20.22) 



The vector 6(t) is determined at each time step in order to minimize the crite- 
rion 

N 

Sm{ 0) =Y.e}{t) (20.23) 

i=l 

by using the following recursive equations 

9{t) = 9{t — 1) + K(t) [y{t) — bo u{t — k — 1) — (^^(t — k — 1) 9{t — 1)] 

K(t) = P{t — 1) (j){t — k — 1)[1 -\- (^^(t — k — 1) P{t — 1) (j){t — k — 1)]“^ 

P{t) = 1 {P{t - 1) - K{t)[l + -k-l)P{t- 1) 4>{t-k- l)]K^{t)} 

(20.24) 

where A is the forgetting factor. 

The control input is calculated by using the last parameter estimations 

u{t) = k[M9~^)y{t + 1 ) - y{t + 1 )] - [B{q~^)u{t) - u{t)] = e{t) 

OQ Oq 

(20.25) 

The algorithm of the self-tuning controller estimates the model parameters in 
order to obtain good regulation and normally gets close to a minimum variance 
controller. The least-squares estimations can be biased if the disturbances are 
correlated. This algorithm demands that some parameters are specihed before 
activating the controller: the controller structure and n^, the pure delay /c. 
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the sampling period Te, the scaling factor the forgetting factor A, as well as 
some initial values Pq and 6>o • 

The self-tuning controller can include a feedforward action (Dahlqvist, 1981). 
To predict the disturbance effect on the process, the process model is modified 
by including a term for the measured disturbance d according to the equation 

A{q~^) y(t) = boB{q~^) u{t — k — 1) L{q~^) d{t — k — 1) e{t) (20.26) 

with the parameters of the disturbance term defined by 

L(q ^) = Iq -\- h q ^ Im q (20.27) 

Sastry et al. (1977) used the self-tuning controller to control only the top 
composition of a pilot distillation column. Dahlqvist (1981) extended the con- 
trol to multivariable systems and chose the reboiler vapour fiow rate V and the 
refiux fiow rate R as control variables. When the top and bottom compositions 
are simultaneously controlled, the column must be considered as a multiva- 
riable system because of the coupling between the rectification and stripping 
sections. With two single-input single-output controllers in the simultaneous 
control, a controller action in a control loop would appear as a disturbance 
on the other one. The magnitude of the interaction between the loops can be 
decreased by a convenient choice of the control variables. The algorithm can be 
extended to treat multivariable systems. The process model can be formulated 
as the following model with two inputs u and two outputs y 

Vl+k+i + aAqA Vt + vl = B\qA + C\qA u\ ^ > 

yi and y 2 are the top and bottom compositions, ui and U 2 the reboiler vapour 
and refiux fiow rates. The polynomial C is analogous to polynomial A. The 
parameters of A^^ and C take into account the internal couplings in the 
column. The parameters of are larger than the parameters of (T^, as the 
coupling between the top composition and the reboiler fiow rate is larger than 
the coupling between the bottom composition and the refiux fiow rate. 

Woinet et al. (1991) described an adaptive pole-placement control on a 
pilot distillation column. Note that pole-placement is seldom used as such in 
adaptive control. 

Ohshima et al. (1991) described an industrial application of control of a 
distillation column of fatty acids presenting important disturbances. A model- 
based predictive control algorithm was modified to include the prediction of 
disturbances. 

Instead of single-input multi-output open-loop identification, Chou et al. 
(2000) demonstrate that multi-input multi-output closed-loop identification 
presents decisive advantages for developing nonlinear models (Wiener models) 
for control of high-purity distillation columns. 

20.4.5 Model Predictive Control 

Historically, distillation columns were among the first important processes to 
have derived profit of advanced control through the development of model pre- 
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dictive control (MPC) (Richalet et al., 1978). Among the significant advantages 
of MPC are the multivariable control with many inputs and outputs and the 
handling of constraints. Since that period of expansion of MPC, many indus- 
trial applications have been reported (Qin and Badgwell, 2000, 1996). It must 
be noted that probably all these applications make use of linear MPC based 
on the information provided by the responses of the controlled outputs to ad- 
equate steps on selected inputs. In this context, the well-known model of the 
distillation columns is not used except for simulation and nonlinear MPC is 
not commonly implemented, although it is mentioned (Bock et ah, 2000; Diehl 
et ah, 2001). For more details on MPC, refer to Chap. 16. 



20.4.6 Bilinear Models 

Linear models commonly used in control are obtained by linearization in the 
neighbourhood of steady state and can describe the dynamic phenomena only 
in a limited range around this operating point. Nonlinear models are not com- 
monly used in control. Bilinear models (Espana and Landau, 1978) constitute 
a compromise and allow us to describe the behaviour of columns in a wider 
domain than simple linear models. 

To justify the bilinear model issued from the knowledge model, it is ne- 
cessary to recall this latter model. The considered column is identical to that 
described by Fig. 20.1; it has n plates plus the condenser. The simplifying 
hypotheses are the following: 

HI: Liquid- vapour equilibrium is instantaneous, the efficiency is constant on 
each plate. 

H2: The influence of hydrodynamics is negligible. 

H3: The vapour holdups are zero. 

H4: The condenser is total. 

H5: The molar liquid holdups are constant on each plate. 

H6: The enthalpies are constant on each plate. 

H7: The heat losses are neglected. 

H8: The inlet and outlet flows are saturated liquid. The dynamics is described 
by the classical equations for a plate i (1 < i < n ^ 1): 

Mass balance for component j: 
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reboiler subscript 1 , condenser subscript n + 1 (not a real plate in the case of 
a total condenser). 

5ij'. Kronecker symbol (= 1 if i = 0 if i 7 ^ j) 

Three inputs are considered: two control inputs L and Qb (equivalent to 
Vi) and a disturbance F. The feed composition zf that would be a disturbance 
is taken as a parameter. The controlled outputs are xf> and xb- The system 
contains 2(n + 1) linear equations (20.30) and (20.31) to be solved with respect 
to the 2(n + 1) flow rates on the plates and the condenser. The internal flow 
rates Li and Vi depend linearly on the input vector 

vZ = [LFbF] (20.32) 

The new system can thus be written for a component j according to the vector 
notation 









Y^Bluk 



(x) + 



Mi, 



F 7^ 



(20.33) 



The state Eq. (20.29) is first-order homogeneous with respect to the control 
vector u. The vapour mole fractions yj are related to the liquid mole fractions 
Xj by the equilibrium constant kij that depends on temperature and pressure 
for a given plate i. If kij can be considered as constant through the column (at 
least piecewise), we can write: yj = hj + kj xj , and we get the bilinear relation 












^Uf ^ ^F 



(20.34) 



To linearize this expression, it would be necessary to not consider the coupling 
terms Uk xj between state and control variables. 

The expression 20.34 is first-order homogeneous both with respect to the 
control vector u and the state X] this bilinear model thus makes the internal 
flow rates Li and Vi independent of the concentrations and linearly dependent 
on the control vector. It thus corresponds to hypothesis H5. 

With the number of state variables being important, it is attractive to per- 
form a reduction of the previous model. In this case, the column is considered 
to be formed by three compartments (Fig. 20.8), one for the condenser, one 
for the rectification section and one for the stripping section. 

Using the deviation variables with respect to the steady state (exponent e) 
for state and control variables 



Suk = Uk — u% and: 5xi = Xi — x\ 



(20.35) 
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Figure 20.8: Plate aggregation 



and the hypothesis of constancy of kij^ a simplihed model is obtained 
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(20.36) 

This equation shows that the global system global includes only eight indepen- 
dent parameters allowing us to describe the dynamic behaviour of the column 
with three state variables. Equation (20.36) can be written in a more compact 
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form that is similar to to Eq. (20.34) 



6x = A 6x 



3 

Nk Suk 



Sx B Su 



(20.37) 



20.4.7 Nonlinear Control 

Several nonlinear controls of distillation columns have been published since 
about 1990. We will particularly cite Levine and Rouchon (1991); Rouchon 
(1990) as their control method is actually implemented in several industrial 
plants, which shows its interest. Rouchon (1990) applied singular perturbation 
techniques to a knowledge model similar to the previous model given by Eqs. 
(20.29) to (20.31) to obtain an aggregated nonlinear model usable in control. 
The objective is to preserve the nonlinear and qualitative properties of the 
knowledge model (steady-state gains, mole fraction in [0,1], global asymptotic 
stability) and to calculate a simple and robust nonlinear control law, rejecting 
the feed composition disturbances. 

In this model, the condenser is considered as a true plate (subscript n + 1). 
The hypotheses concerning the column are classical and close to those taken 
for the bilinear model. 

In particular, liquid and vapour are perfectly mixed and at equilibrium on 
each plate: the mass transfer time constants between liquid and vapour are 
much shorter than the residence time on each plate. 

The liquid molar holdup is constant on each plate, the pressure constant 
and uniform, the vapour molar holdup negligible on each plate: hydrodynamics, 
pressure and level dynamics are stable and sufficiently fast to be neglected. 

In general, the plate geometry and the pressure and level control loops are 
designed in a way so that these two hypotheses are satisfied for sufficiently 
smooth inputs L, E, F. 

The Lewis hypothesis of constant molar fiows in each section of the column 
is adopted; in the case of components such as hydrocarbons which have close 
physical properties, it is justified. 

While for Espana and Landau (1978) the equilibrium constant kij was con- 
stant, here it is calculated by means of Soave equation of state. Disturbances 
concern the feed: F, which is measured, and zf, not measured. 

The complete simulation model can be written in the condensed form 

Xi = fi{x^ zf) , l<i<n (20.38) 

for any rectification plate: 

fi{x,L,V,F,zp) = -^{Lxi+i +Vk{xi-i) -Lxi -Vk{xi)) (20.39) 
for the feed plate: 

1 f Lxip+i+Vk{xip-i) 

Mip Y ~{F + F) Xip — V k{xip) -\- F zp 




fip (x, L, V, F, zp) 



(20.40) 
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and for any stripping plate: 

fi{x, L, V, F, zf) = ]^ ((-^ + ^*+1 + ^ K^i-i) -{LFF)xi-V k{xi)) 

(20.41) 

The model reduction is performed by considering that it is a two time-scale 
system, one fast: 

ex^ = ^ ^ e) (20.42) 

the other one slow: 

= /^ , IX , 1C , e) (20.43) 

where (x^, x^) is the state vector, u the control vector, w the disturbance vector 
and e a small positive scalar. It is possible to keep only the slow dynamics 
corresponding to e = 0. Then, a variable change is realized in order to separate 
the column into sections of consecutive plates (compartments) and aggregate 
each section. If a section of m real plates is considered, and if ia is the subscript 
of the aggregation plate, it is possible to show that by doing the following change 
of state variables, which touches only the composition of the aggregation plate 

Xi = x{ for: (20.44) 

m \ m 

Mj Xj \ /M for: i = ia with: M = Mj (20.45) 

j=i / j=i 

we obtain a decomposition of the system of equations into a slow sub-system 
M x^ = Lxmxi + V k{xo) — L x{ — V k{x(^) , i = ia (20.46) 

0 = Lxi^i + V k{xi-i) — L Xi — V k{xi) , 1 < x < Xa — 1 and x^ + 1 < x < m 

(20.47) 

(where and yo = k{xo) are the compositions entering the section of m 

plates), and a fast sub-system 

— dx ■ 

ai M — ^ = Lxi-^i^V k{xi-i)—Lxi—V k{xi) , 1 < x < x^ — 1 andXa+1 < i < m 
dr 

(20.48) 

with a and r dehned by 

Mi=eaiM (where 0 < e <C 1) andr = t/e (20.49) 

Rouchon (1990) obtains that the dynamics of the section of m plates can be 
approximated by plates i ^ ia without holdup and one plate i = ia having the 
holdup of the section. 

Knowing that the holdups are more important in the condenser and in the 
reboiler than on the other plates, these two external plates remain unchanged 
and constitute two compartments. The rest of the column is divided into one 
to three compartments. In the case of the studied depropanizer, comparisons 
of open-loop trajectories of product compositions have shown that three com- 
partments for the internal column constitute a good compromise: 
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one rectification compartment {%r < n) (aggregation plate r) 

one feed compartment (v + 1 ^ ^ ^ — 1) (aggregation plate ip) 

one stripping compartment {2 < i < is) (aggregation plate s) 

The entire column is thus represented by a total of 5 compartments. 

The complete reduced model is thus as follows: 

Equation for the condenser compartment (subscript n 1): 



Mn+l Xn+1 = V k{Xn) ~ V Xn+1 (20.50) 

Equations for the internal column: 

Rectification (r + 1 < i < n): 

0 = Lxipi + V k{xi-i) — Lxi — V k{xi) (20.51) 

Rectification aggregation plate: 

Mr Xr = LXrXl + V k(Xr-l) — L Xr ~ V k{Xr) (20.52) 

Bottom of rectification and top of feed {ip I < i < r — 1): 

0 = Lxipi + V k{xi-i) — Lxi — V k{xi) (20.53) 



Feed aggregation plate: 

MpXip = Lxi^pi + V k{xip-i) - (T + F)xip - V k{xi^) ^ F xp (20.54) 
Bottom of feed and top of stripping (5 + l<i<ii? — 1): 

0 = (T + F) Xipi -\- V k[xi—i) — (T + F) Xi — V k(xi) (20.55) 

Stripping aggregation plate: 

Ms ±s = F) Xspi + V k{xs-i) - {L ^ F)xs - V k{xs) (20.56) 

Bottom of stripping (2 < i < 5 — 1): 

0 = (L + F) Xipi + V k{xi-i) - {L ^ F) Xi - V k{xi) (20.57) 

Equation for the reboiler (subscript 1) 

Ml ±i = {L ^ F) X2 F {L ^ F — V) xi — V k{xi) (20.58) 

The system of five differential equations has the same tridiagonal structure 
as the original system and constitutes the control system where L and V are 
the control variables, xi (= xp) and Xn+i (= xp) the outputs, F a measured 
disturbance and zp an unmeasured disturbance. In fact, it is easier to choose 
L/V and (L + F)/V as control variables. The system is solved in the same 
way. 
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If we denote the two outputs by yi 
solutions of the system 

{ dyi _ 

dt 

dy2 _ 

dt 



xi (reboiler) and 1/2 = Xn (distillate) 



01 (yi) 

02(^2) 



(20.59) 



the following condition of PI behaviour resting on the deviation between set 
point and output is imposed 



< 



P ' IP 

Tf ^ 

y2it)-y2{t) /o(y2(M) -y2(M))t^M 

ID ' T TD 



(20.60) 



The closed-loop analysis by nonlinear disturbance rejection techniques shows 
the local stability of feedback control; from the results, it seems that greater 
stability is likely. The control law depends on ( xb, Xr, Xg, xd-> F)] in fact, Xr 
and Xg are not measured and are estimated by temperature measurements at 
adequate points of the considered compartments (thermodynamic equilibrium 
equation k = f{x) and T = f{x)) in the feedback control law. The position 
of the aggregation plates r and s is not very important. Creff (1992) extended 
the results of Rouchon (1990) to multicomponent columns. 

For multicomponent mixtures, it is necessary to measure the temperature 
not only on a sensitive plate, but also at different points of the column (Yu 
and Luyben, 1984). Nonlinear observers (Lang and Gilles, 1990) have been 
developed from the knowledge model of the column; they allow us to estimate 
the temperature and composition profiles. Knowing that the movement of high 
mass transfer zones is deciding to characterize the dynamic behaviour of the 
column, it is possible to place the sensors in an optimal manner. Mejdell and 
Skogestad (1991) show that it is simply possible to estimate the compositions 
from temperature measurements and obtains the best results with a partial 
least-squares static estimator, taking into account uncertainties and different 
noises. 

The nonlinear control law by aggregation allows good disturbance rejection 
and it is not very sensitive to uncertainties and time delays due to the measure- 
ments. This nonlinear control law has been favourably compared with regard 
to stability and robustness to the linear control law obtained by the geometric 
approach of Takamatsu et al. (1979); this linear law uses the complete linea- 
rized model and is more sensitive to measurement delays (for concentrations 
obtained by chromatography) than the aggregated nonlinear model. The geo- 
metric approach of disturbance rejection (Gauthier et ah, 1981) has also been 
compared (Rouchon, 1990); in this case, the complete nonlinear model is used; 
it poses the same problem as the work of (Takamatsu et ah, 1979) in the pres- 
ence of delays. The aggregation techniques developed in this chapter in the 
frame of distillation are general and thus can be applied to a tubular reactor or 
to any chain that can be modelled as a series of several unit elements. Other 
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nonlinear studies (Bequette, 1991) have been published, using, for example, 
nonlinear model predictive control that consists of on-line optimization based 
on a knowledge model that allows us to numerically calculate the control law 
while taking into account possible constraints (Eaton and Rawlings, 1990). 

20.5 Conclusion 

Concerning the much studied physical system of the distillation column, a wide 
diversity of approaches of the control law is noticed; this diversity is partly 
related to the use of very different models going from black-box models ob- 
tained by identification to knowledge models used in their globality. Through 
the retained examples, which do not pretend to cover all this area, it can be 
observed that the more recent approaches are, in general, more complex and 
time-consuming, as they demand important intellectual investment in the de- 
sign of the control system. In return, a more stable and safer functioning, in 
a larger domain around steady state, considering the disturbances, with some- 
times the possibility of economic on-line optimization, is obtained. Many pro- 
blems remain to be solved, in particular, in the case of high-purity distillation of 
non-ideal multicomponent mixtures. The progress in computing performance 
allows us to consider solutions that integrate more and more complete know- 
ledge models. However, the handling of constraints, the robustness related to 
the modelling and measurement errors, to the measurement delays, will always 
be of great importance. 
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Chapter 21 

Examples and Benchmarks 
of Typical Processes 



In this chapter, a series of different processes that are used as examples or 
benchmarks in the literature to test control strategies are given. 



21.1 Single-Input Single-Output Processes 

21.1.1 Description by Transfer Functions 

Process with Large Time Constant and Small Delay 

The following plant continuous transfer function 

poses some problems, in particular, in discrete-time identification for further 
use of a discrete-time controller. A high closed-loop bandwidth is desired. 
Using the common following rule for the sampling period Tg 

27T , , 

Ts<— ( 21 . 2 ) 

UJb 

Lunstrom et al. (1995) mention that to obtain a truncation error lower than 5%, 
300 step coefficients are needed. However, this introduces a conflict with in- 
dustrial practice, where a lower number of coefficients is retained: 30 according 
to Cutler and Ramaker (1980). 

To demonstrate the influence of truncation, use a given discrete-time con- 
troller and analyse the response to a step input occurring at time t = 10. Lun- 
strom et al. (1995) show that a DMC controller leads to closed-loop instability 
due to the large model error. 
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21.1.2 Description by a State-Space Knowledge Model 

Van de Vusse Reactor 

The Van de Vusse reaction is operated in a continuous stirred tank reactor. It 
is often used in control literature (Chikkula and Lee, 2000; Stack and Doyle 
III, 1997). The reaction is 



A ^ B ^ C 
2A^ D 



(21.3) 



with the following material balances for components A and B 
^ = -kiCA -k3Cl + ^ {CAf - Ca) 

A A (21-4) 

-jf=k,CA-k2CB-^CB 
at V 

The nominal operating conditions are given in Table 21.1. The control pro- 
blem consists of regulating the concentration of intermediate product Cb hj 
manipulating the inlet flow rate F. Fq is the steady- state flow rate correspon- 
ding to a steady-state value of Cb = 1 mol.D^. When F increases from 0 
to 250 l.h“^, the steady-state concentration Cb first strongly increases, then 
decreases. Thus, the process is highly nonlinear with a change of sign of the 
steady-state gain. Also, when the steady-state concentration is lower than 
the maximum steady-state concentration Cb, the reactor presents an inverse 
response, thus is nonminimum-phase. This corresponds to the proposed ope- 
rating point of Table 21.1. 



Table 21.1: Nominal variables and main parameters of the Van de Vusse 
CSTR 



Reactor volume 


V = 11 


Nominal feed flow rate 


Fo = 25 l.h-i 


Feed concentration of reactant A 


Caj = 10 mol.R^ 


Kinetic constant 


ki = 50 


Kinetic constant 


k2 = 100 


Kinetic constant 


^3 = 10 l.mol“^.h“^ 



Stability Study of a Continuous Stirred Tank Reactor 

Uppal et al. (1974) have proposed a parametric model for a continuous stirred 
tank reactor which allowed them to study the occurrence of multiple steady 
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states. This reactor is modelled by the following set of equations 

= \FCAf + F{l-\)CA-FCA-Vko exp(-T) Ca 

VpCp— = pCpF{XTf + {l-X)T-T)+V {-AH) h ^M~^) Ca 



-hA{T-Tc) 

(1 — A) corresponds to the fraction of recycled outlet stream. 
In order to obtain a system in dimensionless form 

= fl{xi,X2) 

X2 = f2{xi,X2) 

the following auxiliary variables are introduced 

CAf-CA T-Tf 



(21.5) 



( 21 . 6 ) 



Xi = 

r 

I)(i = 



Cap 
V 

¥x 

ko exp (- 7 ) V 



X2 = 



t = 



, Tf \RTf 

t'FX t' 



E 



X2c = 



FX 

Tc-Tf 



= /co exp(- 7 )r 



7 = 



E 






E 



B = 






E 



V T 
{-AH)CAf 

P Cp Tf \ RTf 

hA hAr 



FXpCp VpCp 



RT 



f 



(21.7) 

At this stage, a stability study can be performed. 

Show that the stability condition for a dynamic system of dimension n 
modelled as 

X = Ax ( 21 . 8 ) 

results for a system of dimension 2 in the following condition 



det(A) > 0 ; trace of (A) < 0 (21.9) 



Show that the stationary solution (xf,X 2 ) of the system (21.6) satishes the 
equations 



X 



s 

2 



Da 



Bx{ E pX2c 
1 + /5 



1 — x\ 



exp 



-{Bx{ E ( 3 x 2 c) 



(1 E [3) H — {B x\ E Px 2 c) 
7 



(21.10) 



Plot the curve representing the dependence of x{ (ordinate) with respect 
to Da (abscissa) for given values of 5, /3, 7 , X 2 c (choose B = 14.94, (3 = 2.09, 
7 = 25, X 2 c = —2.78). Interpret this curve with respect to stability. 

From the values of Table 21 . 2 , determine whether the operating point is 
stable or unstable. 
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An Unstable Continuous Stirred Tank Reactor 

Consider again the unstable continuous stirred tank reactor studied by Uppal 
et al. (1974). Its behaviour is interesting from a control viewpoint as it is 
highly nonlinear and open-loop unstable. It has been used by Henson and 
Seborg (1993). 

The reaction is first-order A ^ B. The model is given as 



dCA 

dt 

dT 

dt 



= y “ ^a) - ko exp 

q AiJ, 

fcoexp 



E 

E 



Ca 



Ca + 



UA 

VpCp 



{Tc - T) 



( 21 . 11 ) 



The manipulated variable is the coolant temperature Tc and the controlled 
variable is the reactor temperature T. Ca is the effluent concentration. 

Tf and q are respectively the feed concentration, temperature and flow rate. 
Table 21.2 gives the values of the nominal variables and main parameters. 



Table 21.2: Nominal variables and main parameters of CSTR defined in 
(Henson and Seborg, 1993) 



Feed flow rate 

Feed concentration of reactant A 
Feed temperature 
Reactor volume 
Heat transfer coefficient 
Preexponential factor 
Reduced activation energy 
Heat of reaction 
Density of reactor contents 
Heat capacity of reactor contents 
Coolant temperature 
Concentration of reactor contents 
Reactor temperature 



q = 100 l.min ^ 

CAf = 1 mol.R^ 

Tf = 350 K. 

U = 100 1 

UA = 5 X 10^ J.min“^.K“^ 
ko = 7.2 X 10^^ min“^ 

E/R = 8750 K 

—AH = 5 X 10^ J.moR^ 

p = 1000 g.1-1 

Cp = 0.239 J.g-bK-i 
Tc = 311.1 K 

Ca = 9.3413 x 10-^ mol.r^ 
T = 385 K 



Henson and Seborg (1993) show a severe nonlinear behaviour by comparing the 
open-loop responses of Ca to ±5 step changes in Tc. In this reactor, they have 
applied different control strategies. In particular, they exactly discretized the 
previous continuous model with a sampling period Tg = 0.1 min and obtained 
the discrete-time transfer function 



G{z) 



1 0.2801 - 0.0853^-1 

1 - 1 . 592^-1 - 0 . 7801^-2 



(21.12) 



They then compared the performances of internal model control (IMC) and of 
nonlinear IMC. For IMC, they used a first-order filter with constant a = 0.368, 
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corresponding to a continuous filter of time constant 0.1 min. The perfor- 
mances were compared for ±10 K step changes in the set point and for —25% 
unmeasured step disturbance in the feed flow rate q. 



A Chemical Reactor 

Let us take again the chemical reactor modelled by equations (19.9) in Chap. 
19. Consider only the thermal behaviour of the reactor, in the absence of 
chemical reaction, expressed by the two equations of energy balance leading to 
the derivatives of the reactor and jacket temperatures 



m — = FopCp{To -T)-UA{T- T,) 
cJT^ ' 

Vj Pi Cpi ± = Fj Pj Cpj {Tjpn - Tj) +UA{T- Tj) 



(21.13) 



as well as the linear dependency of the inlet temperature in Aie jacket 
with respect to the position a of the three-way valve according to 



T,„n = ceTe±(l-o)T/ (21.14) 

Then, the expressions of the Laplace transforms of the temperature T of the 
reactor contents and the jacket Tj can be obtained. The manipulated variable 
is either oi a according to the desired degree of subtlety. From a block 
diagram for the reactor temperature control, the relations between T and Tj 
can be emphasized. Lastly, different types of control from PID to more efficient 
ones can be performed. 



A Simplified Heat Exchanger 




Figure 21.1: Heat exchanger 
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Consider the heat exchanger represented by Fig. 21.1. We wish to control 
temperature Tc of the cold fluid (subscript “c”) by manipulating the mass flow 
rate of hot fluid (subscript “h”). The temperatures Th^in (inlet of hot fluid) 
and Tc^in (inlet of cold fluid) are likely to change as well as the mass flow 
rate of cold fluid Fc. In order to simplify the study, it is assumed that the 
outlet temperature of the hot fluid is also the same at any point of the 
heat exchanger (thus considered from this point of view as a perfectly stirred 
reactor). The same hypotheses are taken for the outlet temperature Tc of the 
cold fluid. A global heat transfer coefficient h is considered between the tubes 
(for the cold fluid) and the shell (for the hot fluid). The steady-state data are 
given in Table 21.3. 



Table 21.3: Steady- state values of the different variables for the heat ex- 
changer 



Variable 


Symbol 


Value 


Hot flow rate 


Fh 


29.68 kg.s“^ 


Cold flow rate 


Fc 


37.32 kg.s-i 


Hot temperature at the inlet 


Th,in 


420 K 


Hot temperature at the outlet 


Th^out 


380 K 


Cold temperature at the inlet 


T 

-J- c,^n 


295 K 


Cold temperature at the outlet 


Tc^out 


330 K 


Density of the hot fluid 


ph 


900 kg.m“^ 


Density of the cold fluid 


pc 


800 kg.m“^ 


Mass of hot fluid in the exchanger 


rrih 


180 kg 


Mass of cold fluid in the exchanger 


me 


640 kg 


Heat capacity of the hot fluid 


Cph 


2200 J.kg-i.K-i 


Heat capacity of the cold fluid 


Cpc 


2000 J.kg-i.K-i 


Global heat transfer coefficient 


h 


740 W.m-2.K-i 


Exchange surface 


A 


70.6 



Show that the analytical equations of the balances of this heat exchanger lead 
to the following linearized model (21.15) given in numerical form 

- -0.5517 - 0.5883 - 0.4117 - 

Tc — Fc T Tc in T T}i 

10.095+1 10.095 + 1 ’ 10.095+ 1 15i 

- 0.7487 - 0.5555 - 0.4445 - 

Th — Fj^ + T}i in + Tc 

3.3695 + 1 3.3695+ 1 ’ 3.3695+1 

Considering the role played by the temperatures Th and Tc, the equations of 
this heat exchanger present a great analogy with the jacketed chemical reactor 
(Eqs. 21.13) considered with respect to the temperatures T of the reactor 
and Tj of the jacket. It is interesting to demonstrate this analogy by a block 
diagram including the control elements. 

Assume that an actuator is used with its transfer function being a pure gain 
Ka = 0.1 and a sensor whose transfer function is also a pure gain Km = 0.02. 
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On the other hand, the sensor is placed in the outlet pipe of the cold fluid 
which introduces a delay equal to 4 s. The control of this heat exchanger can 
be performed in many ways. 



A Fed-Batch Bioreactor 



Fed-batch bioreactors offer numerous possibilities of study in dynamic opti- 
mization, state estimation and control. Thus Rodrigues and Filho (1996) stud- 
ied a fed-batch bioreactor of penicillin production and tested it with predic- 
tive DMC control. The bioreactor cited hereafter is used by Srinivasan et al. 
(2002a) to demonstrate the maximization of the concentration of product P 
(also penicillin) for a given hnal time. X is biomass, S is substrate and F is 
the manipulated feed flow rate. The model equations are 



X = m(5)X--X 

^Ji{S)X ^X F 
Yx Yp^V ^ ^ 

P=.x-^p 

V = F 



(21.16) 



with 



m(^) 



fJjm S 

KmFSFSyK, 



(21.17) 



and the model parameters, the initial values and the operating constraints are 
given in Table 21.4. 



Table 21.4: Model parameters, initial values and operating constraints for 
the fed-batch bioreactor cited by (Srinivasan et ah, 2002a) 



Constant 


IJ^m = 0-02 l.h ^ 


Constant 


= 0.05 g.l-i 


Inhibition constant 


Ki = 5 g.l-i 


Yield 


Yx =0.5g[X].g-i[^] 


Yield 


Yp = 1.2 g[P].g-i[^] 


Parameter 


p = 0.004 l.h-i 


Initial biomass concentration 


Xo = 1 g/1 


Initial substrate concentration 


^0 = 0.5 g/1 


Initial product concentration 


-Po = 0 g/1 


Initial volume of the reactor 


Vo = 150 1 


Substrate concentration in the feed 


Sin = 200 g/1 


Final time 


tf = 150 h 


Constraints on the feed flow rate 


0 < F < 1 l.h-i 


Constraints on the biomass concentration 


X < 3.71.h-i 
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Srinivasan et al. (2002b) treated the same example, but with different pa- 
rameters and also different operating conditions. They also provide several 
examples of various processes in view of dynamic optimization. 

Another model for batch penicillin production, including the influence of 
pH and temperature, is given by Birol et al. ( 2002 ). 



21.1.3 Description by a Linear State-Space Model 

A Continuous Stirred Tank Reactor 



This reactor (Ramirez, 1994) has been used (Choi et ah, 2000) to demonstrate 
constrained linear quadratic optimal control. The concerned reaction is the de- 
composition of hydrogen peroxide H 2 O 2 into liquid water and gaseous oxygen. 
Temperature and hydrogen peroxide concentration in the reactor are the two 
states. Temperature is directly measured and hydrogen peroxide concentration 
can be estimated from the reaction rate by monitoring gaseous oxygen evolu- 
tion rate. The manipulated input is a cooling valve control signal. This input 
is constrained in the range [— 1 , 1 ]. 

The linearized continuous state-space model is 



-0.0556 -0.05877 

0.01503 -0.005887 



-0.03384 



or the following discrete state-space model is 



x{k + 1 ) 



0.8347 -0.1811 ,,, 0.0108 

0.0463 0.9754 -0.1137 



(21.19) 



21.2 Multivariable Processes 

21.2.1 Matrices of Continuous Transfer Functions 

A Subsystem of a Heavy Oil Fractionator 

This process is a top 2 x 2 subsystem of the heavy oil fractionator modelled 
in the Shell Standard Control (Prett and Garcia, 1988). It has been used by 
Zafiriou (1990) and Vuthandam et al. (1995). The process transfer function 
matrix is 

4.05 exp(— 27s) 1.77 exp(— 28s) 

505 + 1 605 + 1 

5.39 exp(— 185 ) 5.72 exp(— M 5 ) 

505 + 1 605 + 1 

where the time unit is minutes. Vuthandam et al. (1995) used a sampling 
interval of 4 min and the following constraints 

-3.0 < Au2{k) < 3.0 
-5.0 < U2{k) < 5.0 
-0.5 < yi{k^7) < 0.5 

with set points: yf=y 2 = 0-0 and disturbances: di = 1.2 and ^2 = —0.5 . 



(21.20) 
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A Three-product Distillation Column 



This example of a three-product distillation column (Deshpande, 1989) has 
been used (Al-Ghazzawi et ah, 2001) for testing model predictive control. This 
column separates an ethanol-water feed into ethanol at two locations, overhead 
and sidestream, and water at the bottom. 

The dynamic model is 





"0.66 exp(— 2.65) 


—0.61 exp(— 3.55) 


—0.0049 exp(— 5) 








6.75+1 


8.645 + 1 


9.065 + 1 




P -| 


yi 

V2 

y^. 




1.11 exp(— 6.55) 


—2.36 exp(— 35) 


—0.012 exp(— 1.25) 




Ui 

U2 

+ 3 _ 




3.255 + 1 


55 + 1 


7.095 + 1 






— 34exp(— 9.25) 


46.2 exp(— 9.45) 


(10. 15 + 0.87) exp(— 5) 






L 8.155 + 1 


10.95+1 


73.13s2 + 22.7s + 1 J 





0.14exp(— 125) 
6.25 + 1 



(-0.0295 - 0.011) exp(-2.665) 
114.8552 + 22.845+1 



+ 



0.53 



exp(— 10.^ 

6.95 + 1 

— 11.54 exp(— 0.65) 
7.015+1 



(-0.06275 - 0.0032) exp(-2.665) 
65.1752 + 16.235 + 1 



0 



d\ 

dj2 



(21.21) 

yi is the ethanol overhead mole fraction, ^2 is the ethanol sidestream mole 
fraction and ys is the temperature at tray number 19 (°C). ui is the overhead 
reflux flow rate (l.s“^), U 2 is the sidestream draw-off flow rate (l.s“^), us is 
the reboiler steam pressure (kPa). di and ^2 are respectively the feed flow 
rate (hs“^) and feed temperature disturbances (°C). A typical sampling time 
Tg = Is can be used. However, this is likely to pose problems due to the large 
time delays. 



A Lime Kiln 



The lime kiln is a continuous endothermic reactor in a pulp mill where calcium 
carbonate is transformed into calcium oxide by means of the heat produced by a 
flame at the end of the kiln bed. Because it is one of the most energy-consuming 
processes, its control is particularly interesting. The model is given by Zanovello 
and Budman (1999), who used a variant of model predictive control for this 
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system. The transfer matrix is 







■ 80exp(— 25 ) 


l.lexp(— 155 ) 




^hot 

^cold 





152.55+1 
— 13.65 exp(— 25 ) 


(40s + l)(50s + 1) 
0.3 exp(— 2s) 




^air 

'^gas 


O 2 




6.55+1 


IOO 5 + 1 








_ -0.72 


-0.0075 








r 0.65 exp(— 1495 ) 


1 





255 + 1 



+ 



-0.785 exp(-255) 
82.55+1 



. ^feed . 



-0.011 exp(-255) 



335 + 1 



( 21 . 22 ) 

The front-end temperature must be sufficiently high to ensure complete 

conversion, but not excessively to avoid damage to the refractory lining. The 
cold-end temperature Tj^old t>e too low, to avoid agglomeration of 

the feed, and not too high, to avoid damage to the inner mechanical system 
of chains. The oxygen concentration O 2 in the exit gases is an indicator of 
correct fan power if excessive and also of H 2 presence if low. The manipulated 
variables are the air and natural gas flow rates, respectively and ixgas- The 
disturbance is the feed flow rate 

Nominal operating conditions are: = 1000 °C, = 200 °C, 0.6 < 

O 2 < 5%, = 3mm Hg absolute pressure (approximately 2500 ton/day 

of air), ixgas = 1100 m“^h“^, MUD feed = 800 ton/day, percentage of CaO 



recovery = 90%. 

A robustness study can be performed by assuming the uncertainty tabulated 
in Table 21.5 in the parameters. 



Table 21.5: Parameter uncertainty for lime kiln (Zanovello and Budman, 
1999) 



u 


Gain K 


Delay td 


^hot/^air 


75 < K < 85 


1 < td < 3 


Tot /"“gas 


1 < K < 1.2 


13 < td < 17 


^cold/^air 


-15 < K < -12 


0.1 <td< 0.3 


^cold/^g^s 


0.2 < K < 0.4 


1 < td < 3 


^2/'^air 


-0.74 <K < -0.7 




O 2 / Tgas 


-0.008 <K < -0.007 





A typical sampling time is Tg = 5min. The tracking can be studied for a set 
point change of 10 °C in The disturbance rejection can be tested for a 

step disturbance of 80 ton/day in MUD feed. 
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The following cost function can be defined 

J= ||r[y(fe + l|fe)-y-/(fe + l)]|| + ||AAn(fe)f+ 

\\wj[y{k ^ l\k) - ^ 1)\\ ^ ‘ ^ 

The previous cost function is different from the one used by (Zanovello and 
Budman, 1999) in the penalty term. T and A are diagonal weight matrices, 
is the reference trajectory, y^^'^ is the constraint. 

The constraints given as variations with respect to the steady state are: 
if —0.6% < AO 2 < —0.4% with penalty weight: w = 50, 
if AO 2 < — 0 . 6 % with penalty weight: w = 200 , 

|Tpotl < 20 ° C with penalty weight: w = 3. 

Shell Control Problem 

The Shell control problem (Prett et ah, 1988) concerns a fractionator of heavy 
oil. The system possesses three manipulated variables (heat duty, side draws), 
seven outputs among which two controlled outputs ( top and side draw compo- 
sitions), four secondary outputs (temperatures which can be used for inferential 
control) and a seventh output to be maintained above a given value, and two 
unmeasured disturbances (top and intermediate reff ux) . This problem has been 
treated by several searchers among whom (Velez, 1997; Yu et ah, 1994). The 
model and its uncertainties are given in (Prett et ah, 1988). The control objec- 
tives are to maintain the controlled outputs at specifications, to minimize 1 x 3 
in order to maximize the heat recovery, to reject the disturbances, to obtain 
a certain closed-loop speed response, to respect constraints on the inputs, the 
input velocities, and on some outputs. 



21.2.2 Description by a Linear State-Space Model 

An Unstable Open-Loop System 

Cheng (1989) considered the following discrete two-input two-output system 



x{k + 1 ) 

y{k) 



- 1.1 0.1 



0.2 

1 0 

0 1 



-1.3 



x{k) 



x{k) + 



0.1 

0 



0.2 

0.1 



u{k) 



(21.24) 



to test linear quadratic model algorithmic control. Note that this system is 
open-loop unstable. 



Two-bed Exothermic Catalytic Reactor 

Originally, this two-bed reactor was described by a set of nonlinear partial 
differential equations which have been later linearized and reduced. The model 
used (Foss et ah, 1980) is this reduced linear state-space form. 
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Co 




Ti T2 Ts Q Tq Ts To Tr 



Figure 21.2: Decomposition of the process with inputs, internal variables and 
outputs 



The decomposition of the process as shown in Fig. 21.2 allows us to understand 
the uncommon form of the linear state-space model 



Feed heater: 



Bed I: 



Mixing chamber: 



Bed II: 



To = --To + -T/ 



= A^x^+ 



Ti 

T2 

Ts 

TI 

out 

^ out 



To 

Co 



= C^ x^ + 



To 

Co 



Tm 

C„ 

Q 



= M 



Q 

Tq 

TI 

out 

^ out 



x" = A" x" +B" 



To 

ji. I — 

Tout 
Cout 



Tm 

C„ 

Q 



= C^^x^^ +D 



II 



Tm 

c„ 

Q 



(21.25) 



(21.26) 



(21.27) 



(21.28) 



The data concerning the different matrices in the vicinity of the nominal steady 
state are given by Foss et ah (1980). The manipulated inputs are Tf, Q and 
Tq- Co is a disturbance. The controlled outputs are Tout and Cout- and 
have dimension 7. In this article, the authors used the characteristic loci to 
design their compensators. They prefered that method to LQ control because 
of interpretation. It is clear that such a problem allows the comparison of many 
of the different methods commented on in this book. 
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21.2.3 Description by a State-Space Knowledge Model 

A Semi-batch Reactor 

Chin et al. (2000) use the model of a jacketed semi-batch reactor where exother- 
mic series-parallel hrst-order reactions occur 



A + B ^ C 
B + C ^ D 

The reactor model is 

dT _ Q feed ^rji _ ^ ^ 

dt ~ V ^ ^ VpCp 

^ = -^^CA-hCACB 
dt V 



ri = ki Ca Cb 
V2 = k2CB Cc 



{T-Tj)-^kiCACB 

pCp 

= 0 ) = Ca,o 



(21.29) 



pCp 



k2 Cb Cc 



= ^4^{CBjeed - Cb) - ki Ca Cb - k2 Cb Cc 

_c_ ^ _ Qf^d ^ 

dt V 

^^=Qfeed 



(21.30) 

with the following initial conditions: T( 0 ) = Tq, CA{t = 0 ) = Ca,o^ CB{t = 
0) = 0, Cc{t = 0) = 0, y(t = 0) = Vb and the parameters dehned in Table 
21 . 6 . The kinetic constants follow the Arrhenius law: ki = exp(— ^^/(i?T)). 



Table 21.6: Initial variables and main parameters of the jacketed semi-batch 
reactor 



Initial reactor volume 

Feed flow rate 

Feed temperature 
Initial temperature 
Initial concentration of reactant A 
Feed concentration of reactant B 
Reduced heat transfer coefficient 
Preexponential factor 
Preexponential factor 
Activation energy 
Activation energy 
Reduced heat of reaction 
Reduced heat of reaction 
Sampling period 



Vb = 50 1 

_ / ^ if t < 31min 

Q^eed(t) ift>31 
Tfeed = 308 K 
To = 298 K 
Ca,o = 1 mol.R^ 

CBjeed = 0.90 or 0.95 mol.R^ 

UA/{pCp) = 0.375 l.min“^ 

kio = 5.0969 X 10^^ l.mol“^.min“^ 

/c 2 o = 2.2391 X 10^^ l.mol“^.min“^ 

Ei/R = 12305 K 

E 2 /R = 13450 K 

/AHi/{pCp) = -28.5 K.l.mol-i 

^H2lpCp) = -20.5 K.l.mol-i 

T 5 = 1 min 
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Figure 21.3: Temperature profile to be followed and operations for the semi- 
batch reactor 



For such semi-batch reactors, a typical profile (Fig. 21.3) corresponds to the 
one given by Chin et al. (2000). The reactant B is loaded only between t = 30 
min and the batch terminal time, which is fixed at t = 100 min. During the 
reaction period, it is assumed that the concentration of A is sampled every 10 
min and measured with a delay of 5 min. The desired product is C and the 
objective is production of C equal to V{tf)Cc{tf) = 42 mol. The manipulated 
variables are the jacket temperature Tj and the flow rate Qb of B. Several 
constraints are imposed 



298 K < Tj{t) < 318 K 

0.5 l.min“^ < Qb < 1.5 l.min“^ 



(21.31) 



This reactor model can be used for different studies: open-loop optimal con- 
trol, model predictive control, observer estimation and batch reactor strategies. 



A Cascade of two CSTRs 

Gobin et al. (1994) describe a cascade of two polymerization CSTRs of in- 
dustrial interest to which they applied model predictive control as DMC. This 
process is open-loop unstable and each reactor exhibits three steady states as in 
the polymerization reactor in Sect. 3.2.3. The reaction is the homopolymeriza- 
tion of styrene for which the kinetic scheme and data of Kim et al. (1990, 1991) 
are used. The temperature control is performed by manipulating the flow rates 
in the reactor jackets. By use of the polymerization reaction model, polymer 
properties such as average chain length and polydispersity can be calculated 
as well as the polymer concentration. Constraints are imposed on the coolant 
flow rates and their rate of change. 
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21.2.4 Continuous State-Space Models 

Tennessee- East man Benchmark 

The Tennessee-Eastman benchmark concerns a plant workshop which has been 
made available to the academic community to evaluate process monitoring, 
fault diagnosis and process control strategies. A detailed description is given 
by Downs and Vogel (1993). The plant produces two products and two by- 
products from four reactants. The process has hve main unit operations: 
reactor, product condenser, vapour/liquid separator, recycle compressor and 
product stripper. The model of the process contains 50 differential equations, 
nonlinear and coupled. The open-loop simulation code for the process is writ- 
ten in Fortran and is available, for example, on the following web sites: 
http://brahms.scs.umc.edu (accessed October 2003) or 

http://depts.washington.edu/control/LARRY/TE/download.html (accessed Oc- 
tober 2003) . 

The process is open-loop unstable. 

It has been used by many searchers (Chen, 1997; Duvall and Riggs, 2000; 
McAvoy and Ye, 1994; Ricker, 1995; Ricker and Lee, 1995a,b; Sriniwas and 
Arkun, 1997; Zheng, 1998) 



Benchmark of Wastewater Treatment Plants 

The control of wastewater treatment plants is difficult because of frequent and 
important changes of load in ffow rate and in quality and also to the biological 
processes which are the fundamentals of the plant operation. 

The International Water Quality Association and COST 624 group have 
established knowledge models representing the behaviour of wastewater treat- 
ment plants that can be used to test estimation, diagnostic and control strate- 
gies. COST 624 group published a benchmark (Alex et ah, 2002; Carlsson and 
Rehnstrom, 2002; Pons et ah, 1999; Vrecko et ah, 2002), also available on the 
following web site: 

www.ensic.inpl-nancy.fr/COSTWWTP/ (accessed October 2003) 

The simulated wastewater treatment plants possesses a series of five reac- 
tors, the first two ones being mixed and non-aerated, the three following ones 
simply aerated; this group is followed by a secondary settler. Two recycle 
streams complete the process. The model of the biological process is ASMl 
of lAWQ and includes 13 components and eight reaction processes. Typical 
feed disturbances for dry, stormy or rainy weather are available as representa- 
tive files of 14 days with a sampling period of 15 min. Performance criteria 
have been established concerning the effluent quality; constraints correspon- 
ding to the operating norms are imposed on the effluents and operating costs 
are proposed. 




748 



References 



References 

A. Al-Ghazzawi, E. Ali, A. Nouh, and E. Zafiriou. On-line tuning strategy for 
model predictive controllers. J. Proc. Cont.^ 11:265-284, 2001. 

J. Alex, J.E. Beteau, J.B. Copp, J. Dudley, R. Dupont, S. Gillot, U. Jepps- 
son, J.M. LeLann, M.N. Pons, and P. A. Vanrolleghem, editors. The COST 
Simulation Benchmark. Description and Simulator Benchmark. European 
Gommunities, 2002. ISBN 92-894-1658-0. 

G. Birol, G. Undey, and A. Ginar. A modular simulation package for fed-batch 
fermentation: penicillin production. Comp. Chem. Engng.^ 26:1553-1565, 
2002. 

B. Garlsson and A. Rehnstrom. Gontrol of an activated sludge process with 
nitrogen removal - a benchmark study. Water Science and Technology., 45 
(4-5):137-142, 2002. 

H. Ghen. Stability and Robustness Considerations in Nonlinear Model Predic- 
tive Control. PhD thesis, Stuttgart University, 1997. 

G.M. Gheng. Linear quadratic-model algorithmic control method: a controller 
design method combining the linear quadratic method and the model algo- 
rithmic control algorithm. Ind. Eng. Chem. Res.^ 28:178-186, 1989. 

Y. Ghikkula and J.H. Lee. Robust adaptive predictive control of nonlinear 
processes using input-output models. Ind. Eng. Chem. Res.^ 39:2010-2023, 
2000. 

I. S. Ghin, K.S. Lee, and J.H. Lee. A technique for integrated quality control, 

prohle control and constraint handling for batch processes. Ind. eng. Chem. 
Res., 39:693-705, 2000. 

J. Ghoi, H.S. Ko, and K.S. Lee. Gonstrained linear quadratic optimal control 
of chemical processes. Comp. Chem. Engng., 24:823-827, 2000. 

G.R. Gutler and B.L. Ramaker. Dynamic matrix control - a computer control 
algorithm. In Joint Automatic Control Conference, number WP5-B, San 
Erancisco, GA, 1980. 

P.B. Deshpande. Multivariable Process Control. Instrument Society of America, 
North Garolina, 1989. 

J.J. Downs and E.E. Vogel. A plant-wide industrial process control problem. 
Comp. Chem. Engng., 17(3):245-255, 1993. 

P.M. Duvall and J.B. Riggs. On-line optimization of the Tennessee Eastman 
challenge problem. J. Proc. Cont., 10:19-33, 2000. 

A.S. Eoss, J.M. Edmunds, and B. Kouvaritakis. Multivariable control system 
for two-bed reactors by the characteristic locus method. Ind. Eng. Chem. 
Eundam., 19:109-117, 1980. 




Process Control 



749 



F. Gobin, L.C. Zullo, and J.P. Calvet. Model predictive control of an open- 
loop unstable train of polymerization reactors. Comp. Chem. Enqnq., 18: 
s525-s528, 1994. 

M.A. Henson and D.E. Seborg. Theoretical analysis of unconstrained nonlinear 
model predictive control. Int. J. Cont., 58(5): 1053-1080, 1993. 

K.J. Kim, K.Y Choi, and J.C. Alexander. Dynamics of a cstr for styrene 
polymerization initiated by a binary initiator system. Polym. Eng. Sci.^ 30 
(5):279-290, 1990. 

K.J. Kim, K.Y. Choi, and J.C. Alexander. Dynamics of a cascade of two con- 
tinuous stirred tank styrene polymerization reactors with a binary initiator 
system. Polym. Eng. Sci., 31(5):333-352, 1991. 

P. Lunstrom, J.H. Lee, M. Morari, and S. Skogestad. Limitations of dynamic 
matrix control. Comp. Chem. Engng., 19(4):409-421, 1995. 

T.J. McAvoy and N. Ye. Base control for the Tennessee Eastman challenge 
problem. Comp. Chem. Engng., 18:383-413, 1994. 

M. N. Pons, H. Spanjers, and U. Jeppsson. Towards a benchmark for evaluat- 
ing control strategies in wastewater treatment plants by simulation. Comp. 
Chem. Engng., 238:403-406, 1999. 

D.M. Prett and C.E. Garcia. Eundamental Proeess Control. Butterworths, 
Stoneham, MA, 1988. 

D.M. Prett, C.E. Garcia, and B.L. Ramaker. The Seeond Shell Proeess Control 
Workshop. Butterworths, Stoneham, MA, 1988. 

W.F. Ramirez. Proeess Control and Identifieation. Academic press. New York, 
1994. 

N. L. Ricker. Optimal steady-state operation of the Tennessee Eastman chal- 
lenge process. Comp. Chem. Engng., 19(9):949-959, 1995. 

N.L. Ricker and J.H. Lee. Nonlinear model predictive control of the Tennessee 
Eastman challenge process. Comp. Chem. Engng.^ 19(9):961-981, 1995a. 

N.L. Ricker and J.H. Lee. Nonlinear modeling and state estimation for the 
Tennessee Eastman challenge process. Comp. Chem. Engng.^ 19(9): 983- 
1005, 1995b. 

J.A.D. Rodrigues and M. Filho. Optimal feed rates strategies with operating 
constraints for the penicillin production process. Chem. Eng. Sei.^ 51:2859- 
2864, 1996. 

B. Srinivasan, D. Bonvin, E. Visser, and S. Palanki. Dynamic optimization 
of batch processes II Role of measurements in handling uncertainty. Comp. 
Chem. Engng., 27:27-44, 2002a. 




750 



References 



B. Srinivasan, S. Palanki, and D. Bonvin. Dynamic optimization of batch 
processes I characterization of the nominal solution. Comp. Chem. Engng.^ 
27:1-26, 2002b. 

G.R. Sriniwas and Y. Arkun. Control of the Tennessee Eastman process using 
input-output models. J. Proc. Cont., 7:387-400, 1997. 

A. J. Stack and F. J. Doyle III. Application of a control-law nonlinearity measure 
to chemical reactor analysis. AIChE J., 43:425-447, 1997. 

A. Uppal, W.H. Ray, and A.B. Poore. On the dynamic behaviour of continuous 
stirred tank reactors. Chem. Eng. Sci., 29:967-985, 1974. 

J.C. Velez. Linear Programming Based Model Predictive Control for the Shell 
Control Problem. Master’s thesis, Purdue University, 1997. 

D. Vrecko, N. Hvala, and J. Kocijan. Wastewater treatment benchmark: what 
can be achieved with simple control? Water Seienee and Teehnology^ 45 
(4-5):127-134, 2002. 

P. Vuthandam, H. Genceli, and M. Nikolaou. Performance bounds for robust 
quadratic dynamic matrix control with end condition. AIChE J., 41(9): 
2083-2097, 1995. 

Z.H. Yu, W. Li, J.H. Lee, and M. Morari. State estimation based model pre- 
dictive control applied to Shell control problem: a case study. Chem. Eng. 
Sei., 49(3):285-301, 1994. 

E. Zahriou. An operator control theory approach to the Shell standard control 
problem. In Shell Proeess Control Workshop^ Stoneham, MA, 1990. Butter- 
worths. 

R. Zanovello and H. Budman. Model predictive control with soft constraints 
with application to lime kiln control. Comp. Chem. Engng.^ 23:791-806, 
1999. 

A. Zheng. Nonlinear model predictive control of the Tennessee Eastman pro- 
cess. In Ameriean Control Conferenee^ Philadelphia, 1998. 




Index 



Action 

derivative, 93 
integral, 90 
proportional, 86 
Actuators, 71 
Adaptation 

gain matrix, 426 
Adjoint variable, 509 
Aliasing, 333 
Analog scheme, 41 
Anti-aliasing filter, 334 
Anti- windup, 150 
AR, 392 

ARARMAX, 393 
ARARX, 393 
ARIMA, 392 
ARMA, 392 
ARMAX, 390 
ARX, 388 

Asymptotic stability domain, 106 
Attractor, 106 
Augmented function, 500 
Autocorrelation, 317, 324 
Autocovariance, 324 
Autonomous, 104 

Balance 

energy, 15 
mass, 10 

Bang-bang, 517, 518 
Bayes theorem, 413 
Bellman, 525 

Bezout equation, 167, 170, 466, 558 
Bilinear integration, 362 
Black plot, 195 
Bode 

plot, 188 

stability criterion, 204 
Boiler, 249 
Bolza, 497 
Boxcar, 329 



Brunovsky canonical form, 636 
Byrnes-Isidori canonical form, 639 

Canonical form of Jordan, 266 
Cascade, 244 
Causality, 361 

Certainty equivalence principle, 490 
Characteristic 

equation, 119 
loci, 283 

polynomial, 40, 266, 270 
Chemical reactor 

generalized predictive control with 
performance model, 566 
generalized predictive control, 561 
identihcation, 456 
internal model control, 486 
isothermal, 12, 30 
non- isothermal, 14 
nonlinear geometric control, 682 
pole-placement control, 477 
Cohen-Coon, method of, 143 
Complex numbers, 185 
Condition number, 266, 297, 422, 577 
Conditions, terminal, 500 
Control, 5 

adaptive, 488 
by ratio, 258 
composition, 136 
discrete internal model, 483 
dual, 714 
flow rate, 135 
gas pressure, 136 
generalized predictive, 555 
level, 135 

linear quadratic, 175 
model predictive, 575 
model reference, 463 
nonlinear geometric, 619 
optimal, 493 
pole-placement, 463 
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selective, 249 
split-range, 250 
temperature, 136 
variable, 66 
Controllability, 264 
Controllability, integral, 296 
Controllable 

canonical form, 265 
companion form, 265 
Controller, 67 

digital PID, 481 
feedforward, 251 
one degree of freedom, 174 
proportional, 67 
proportional-integral, 67 
proportional-integral-derivative, 68 
RST, 463 
self-tuning, 719 
two degrees of freedom, 166 
Convolution product, 26 
Corrector, 67 
Correlation analysis, 374 
Costate, 509, 513 
Covariance matrix, 660 
Cramer- Rao inequality, 414 
Criterion 

of Nyquist, generalized inverse, 284 
Bode stability, 204 
lAE, 132 
ISE, 131 
ITAE, 132 
of Jury, 359 
of Nyquist, 213 
of Nyquist, generalized, 284 
of Routh-Hurwitz, 119 
Cross-correlation, 317, 325 
Cross-covariance, 325 
Crystallizer, 153 

Damping, 46 
Decibel, 189 
Decoupling, 286, 645 
Degree (one) of freedom controller, 174 
Degree of freedom, 20 
Degrees (two) of freedom controller, 
166 

Delay, 25, 53 

Delay margin, 225 

Derivative, Lie, 628 

Design of Eeedback Controllers, 129 



Diffeomorphism, 632 
Differential geometry, 627 
Diophantine equation, 167, 170, 558 
Dirac, 6, 43 

Direct Synthesis Method, 159 
Discontinuity condition, 501 
Discrete internal model control, 483 
Discrete transfer function, 349 
Distillation, 250, 252, 258, 522, 705 
column of Wood and Berry, 287 
composition estimator, 659 
extractive, 536 

extractive column, linear quadra- 
tic control, 536 

extractive, linear quadratic Gaus- 
sian control, 544 
NMPC, 600 
robustness study, 303 
Distributed-parameter, 18 
Distribution, 631 
integrable, 631 
involutive, 631 
Disturbance, 5, 66 
model, 445 
DMC tuning, 589 

DMC, dynamic matrix control, 581 
Dominant pole, 131 
Dynamic programming, 525 
Dynamic state feedback, 635, 653 
Dynamics 

of zero, 624 
forced, 624 
internal, 624 
unforced, 624 

Eigenvalue, 40, 105, 275 
Energy, 321 

Balance, 15 
Equation 

Bezout, 466 
characteristic, 119 
error model, 389 
Hamilton- Jacobi, 502, 509 
Hamilton-Jacobi-Bellman, 529 
Lyapunov, 109 
minimal, 271 
Ergodic, signal, 324 
Error, 131 

equation, 389 
of acceleration, 133 
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of position, 133 
of velocity, 133 
prediction a priori, 385 
Euler 

conditions, 499, 500, 506 
method, 362 

Euler- Lagrange lemma, 497 
Evans locus, 121 
Evaporator, 652 
Exogenous, 389 

Expectation, mathematical, 323 
Extended Kalman hlter, 664, 687 

East Eourier transform, 321 
ECC, 600 
Eeedback, 65, 82 
Eeedback difference, 280 
Eeedforward controller, 251 
Eilter, discrete Kalman, 398 
Eirst-order, 44 

Eisher information matrix, 414 
Eluid catalytic cracking, 600 
Eocus, 106 
Eorced 

dynamics, 624 
response, 357 
Eorm 

canonical of Brunovsky, 636 
canonical of Byrnes-Isidori, 639 
canonical of Jordan, 266 
controllable canonical, 265 
controllable companion, 265 
modal canonical, 266 
normal, 623, 633 
observable canonical, 269 
observable companion, 269 
of Smith-McMillan, 282 
Eourier 

discrete transform, 320 
inverse discrete transform, 320 
series expansion, 318 
transform, 314 
Erequency 

analysis, 183 
crossover, 141 
gain crossover, 208 
phase crossover, 201 
specification, 229 
warping, 363 
Erobenius theorem, 630 



Eunction 

Lyapunov, 107 
of repartition, 323 
of sensitivity, 223 
of sensitivity, complementary, 223 
orthogonal, 318 
Eunctional, 495 

Gain, 44 

crossover frequency, 208 
margin, 207, 225 
ultimate, 136 

Generalized predictive control, 555 
Generic model control, 647 
Gershgorin circles, 284, 298 
Gibbs phenomenon, 320 
Globally linearizing control, 646 
Globally linearizing observer, 666 
GPG, generalized predictive control, 555 
Gradient, 423 
Grammian 

of controllability, 272 
of observability, 272 

Hamilton conditions, 503, 509 
Hamilton- Jacobi, 502, 509 
Hamilton- Jacobi equation, 503 
Hamilton-Jacobi-Bellman, 529 
Hamiltonian, 502 
High-gain observer, 667 
Holder, 329 
Horizon 

of control, 583 
of model, 582 
of prediction, 584 
Hurwitz, 120 

lAE criterion, 132 
Identification, 57 

state-space, 397 
Akaike criterion, 452 
IMG, internal model control, 160 
Impulse, 43 
Innovation, 385 
Innovation form, 403 
Input, 5 

Input-output linearization, 638 
Input-state linearization, 635 
Instrumental variable, 416, 443 
Integral curve, 104 
Integral saturation, 150 
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Integrator, 45, 147 
Integrity, 296 
Interaction, 286 
Internal 

dynamics, 624 
model control, 160 
model control, discrete multiva- 
riable, 307 
model principle, 220 
stability, 640 
Inverse response, 55, 241 
Involutivity, 631 
ISE criterion, 131 
ITAE criterion, 132 

Jacobian matrix, 497, 629 
Joint probability, 323 
Joint probability density, 323 
Jordan canonical form, 266 
Jury, stability criterion, 359 

Kalman 

discrete filter, 398 
gain matrix, 399 
Kalman filter, 593 

continuous-discrete, 664 
discrete, implementation, 405 
discrete-discrete, 664 
estimator, 401 
extended, 664 
predictor, 401 

Lagrange, 497 
Laplace, 21 
Laurent series, 343 
Least Squares, 409 
Least squares, 421 
recursive, 431 
recursive extended, 438 
recursive generalized, 438 
simple recursive, 429 
Legendre- Clebsch condition, 502 
Lemma matrix inversion, 400 
Lie 

bracket, 629 
derivative, 628 
Likelihood function, 413 
Limit cycle, 107 
Linear programming, 595 
Linear quadratic control, 530 
by transfer function, 175 



Linear quadratic Gaussian control, 541 
Linearization, 9, 22 
input-output, 638 
input-state, 635 
Luenberger, observer, 663 
Lumped-parameter, 10 
Lyapunov 

direct method, 107 
equation, 109, 272 
function, 107 
stability, 105 

MA, 392 
Margin 

delay, 225 
gain, 207, 225 
modulus, 228 
phase, 208, 225 
stability, 226 
Markov 

parameters, 263, 621 
series, 263 
Mason formula, 77 
Mass balance, 10 
Matrix 

inversion lemma, 400 
Jacobian, 629 
norm, 297 
normal, 302 

of transfer functions, 279 
orthogonal, 291 
orthonormal, 291, 422 
state transition, 262, 398 
Sylvester, 171 
symplectic, 550 
trace, 400 

Maximum likelihood, 412 
Maximum likelihood recursive, 439 
Maximum principle, 511 
Mayer, 497 
Mean, 323 
MIMO, 6 

Minimum principle, 511 
Minimum-phase, 104, 190, 627, 643 
Modal canonical form, 266 
Mode, 35 
Model 

AR, 392 

ARARMAX, 393 
ARARX, 393 
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ARIMA, 392 
ARMA, 392 
ARMAX, 390 
ARX, 388 
Box- Jenkins, 395 
dynamic, 7 

equation error, 388, 389 
general, 395 
MA, 392 

output error, 393 
reference, 469 
RIF, 393 
steady-state, 7 

Model predictive control, 575 
Modulus margin, 228 
Moment, 60, 323 
Moment method, 60 
Monic, polynomial, 168 
Monitoring, 659 
Moving average, 339 
Moving state estimation, 669 
MPC, model predictive control, 575 
Multivariable, 6 
control, 279 
nonlinear control, 647 

Natural response, 357 

Nichols plot, 217 

Niederlinski index, 286 

NIPALS, 661 

Node, 106 

Noise, white, 373 

Noise-spike, 339 

Nominal performance, 224 

Nonlinear multivariable control, 647 

Nonminimum- phase, 190-192 

Norm 

of a signal, 222 
of a transfer function, 222 
of matrix, 297 
Normal form, 623, 633 
Normalized time, 451 
Nyquist 

criterion, 213 

frequency, 363 

generalized criterion of, 284 

generalized inverse criterion of, 284 

plot, 192 

point, 214 

Observability, 268 



Observable 

canonical form, 269 
companion form, 269 
Observation 

vector, 389 
Observer, 657 

globally linearizing , 666 
high-gain, 667 
Luenberger, 663 
moving state estimation, 669 
polynomial, 471 
On-Off control, 151 
Operator 
(5, 366 

backward shift 365 
forward shift g, 365 
Optimal control, 493 
Optimality principle, 525 
Optimization, 57 
Order, 20 
Orthogonal 

function, 318 
matrix, 291 

Orthonormal matrix, 291, 422 
Output, 5, 66 
Output error method, 444 
Overshoot, 49 

Pade approximation, 54 
Pairing, 286 
Parameter 

adaptation algorithm, 424 
distributed, 7, 18 
lumped, 7, 10 
vector, 389 

Parametric identihcation, models, 387 
Parseval-Plancherel, 27, 316 
Partial least squares, PLS, 659 
Partial state, 42 

PGA, principal component analysis, 659 
Performance 
index, 497 
nominal, 224 
robust, 227 
pH control, 154 
Phase 

crossover frequency, 201 
margin, 208, 225 
plane, 106 
portrait, 106 
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Physical realizability, 118 
PID controller tuning, 136 
Plot 

Black, 195 
Bode, 188 
Nichols, 217 
Nyquist, 192 

PLS, partial least squares, 659 
Point 

equilibrium, 105 
singular, 105 
stationary, 105 
Pole, 35, 357 

dominant, 131 
multivariable, 282 
placement, 166, 463 
Pole-zero correspondence, 363 
Polymerization reactor. 111, 524, 653, 
654 

Pontryagin maximum principle, 511 
Prediction 

error, a posteriori, 424 
error, a priori, 424 
Predictor 

a posteriori, 424 
a priori, 424 
multi-step, 385 
one-step, 380 

Principal component analysis, PCA, 659 

Principal direction, 292 

Principle 

of internal model, 220 
of superposition, 33 
Probability density, 323 
Process, staged, 17 
Projection to latent structures, PLS, 
659 

Proper transfer function, 361 
Pseudo random binary sequence, 448 

QDMC, quadratic dynamic matrix con- 
trol, 589 
Quantization, 313 

Random signal, 322 

Random stationary signal, 324 

Rank, 265 

Rate of reaction, 13 

Reachability, 264 

Reactor 



biological, nonlinear control, 692 
catalytic, 247, 250 
chemical, 250 
chemical tubular, 18 
chemical, generalized predictive con- 
trol with performance model, 
566 

chemical, generalized predictive con- 
trol, 561 

chemical, internal model control, 
486 

chemical, isothermal, 12 
chemical, non-isothermal, 14 
chemical, nonlinear control, 681 
chemical, pole-placement control, 
477 

identihcation, 456 
polymerization. 111, 653, 654 
with jacket, 244 
Realization, 262, 271 
balanced, 273 
minimal, 272 
Reconstruction, 341 
Recursive 

least squares, 431 
maximum likelihood, 439 
prediction error, 440 
Reduction of model, 273 
Reference, 148 
model, 469 
trajectory, 148 

Regression, pseudo-linear, 392 
Regulation, 65, 88 
Relative 

degree, 122, 125, 361, 621, 629, 
648 

gain array, 292 
order, 621 

Relay oscillation method, 138 
Response 

forced, 43, 357, 559 
free, 559 
impulse, 43 
inverse, 55, 241 
natural, 43, 357 
step, 43 
RGA, 292 
Riccati 

discrete equation, 401 
equation, 533, 541, 542 
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RIF, 393 
Robust 

performance, 227 
stability, 226 
Robust ificat ion, 444 
Robustness, 173, 221, 296, 298 
Root locus, 121 
Routh-Hurwitz, 119 

Saddle point, 106 
Sampling, 329 
Sampling period, 335 
Second-order, 46 
Sensitivity 

complementary function, 299 
complementary function of, 223 
function, 223, 299 
Sensor, 69 

intelligent, soft, 657 
Separation principle, 543 
Series, Fourier, 318 
Set point, 66 
Shannon, 334 
Signal 

ergodic, 324 
random, 322 
random stationary, 324 
stochastic, 322 
stochastic stationary, 324 
Signal-flow graph, 77 
Single variable, 6 
Singular 

arc, 514, 520 
value, 291 

value decomposition, 291 
Singular value decomposition, 661 
SISO, 6 

Smith predictor, 239 
Smith-McMillan, form, 282 
Smoothing hlter, 339 
Soft 

sensor, 657 

Specihcation frequency, 229 
Spectral 

density, 317, 324 
factorization, 175 
Spectrum, 317 
Square system, 280 
Stability, 20, 103, 282, 372 
asymptotic, 105 



asymptotic domain, 106 
external, 103 
internal, 103, 640 
Jury criterion, 359 
Lyapunov, 105 
margin, 226 

polymerization reactor. 111 
robust, 226 
state- space, 104 
strict, 372 

Stabilizability, integral, 295 
State 

estimator, 657 
representation, 42, 261 
space, 39 

transition matrix, 398 
State feedback 

dynamic, 635 
static, 635 
State-space, 8 
Static state feedback, 635 
Stationary, signal, 324 
Statistical process control, 659 
Step, 43 

Stochastic signal, 322 
Stochastic stationary signal, 324 
Superposition principle, 33 
Surge tank, 10, 30 

Theorem, Cauchy, 213 
Time 

constant, 44 
delay, 53 
rising, 49 
settling, 49 

Total hnite energy, 316 
Trace of matrix, 400 
Tracking, 65, 87 
Trajectory, 104 
Transfer function 
continuous, 27 
discrete, 349 
proper, 29 
Transform 
z, 343 

fast Fourier, 321 
Fourier inverse discrete, 320 
Laplace, 21 
of Fourier, 314 

Transversality conditions, 500 
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Tuning of PID, 136 
Tustin method, 362 

Ultimate 

gain, 136, 203 
period, 137 

Uncertainty relation, 316 
Unforced dynamics, 624 

Validation, 447 
Variable 

control, 65 
controlled, 65 
deviation, 22 
manipulated, 5 
Variance, 323 
Variational method, 496 
Volterra model, 597 

Warping, 363 
Weierstr ass- Erdmann 
condition, 502 
White noise, 373 
Windup, 150 

z 

-transform, 343, 347 
Zero, 35, 357 

dynamics, 624, 639 
multivariable, 282 
Zero-order holder, 346 
Ziegler- Nichols tuning, 137 




